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Method and System for Market Based Resource Allocation 



Background 

A. Technical Field 

This application relates to a system and method for real-time resource allocation 
and, specifically, to a system and method for allowing real-time buyers and sellers to bid for 
resources and controlling the applicable resources in accordance with the bids. 

B. Background of the Invention 

Over the past several years, the Internet has become an important mechanism for 
conducting business. It has helped reduce business cost and enabled the customized 
delivery of goods and services. To facilitate exchange of goods and services, electronic 
commerce technologies have been developed, ranging from simple Internet shopping sites 
to auction sites such as Yahoo (http://www.yahoo.com) and eBay (http ://www. ebay . com) . 

At the same time, a small but growing market for telecommunication network 
bandwidth has developed. Bandwidth, however, is unlike traditional commodities, goods 
and services, in that: 1) it is a shared resource that cannot be stored. As such, any 
bandwidth capacity unused by one party can be made available to all other parties having 
access to it or it is lost. 2) The resource of bandwidth is consumed in real time. That is, it 
can occur simultaneously with the buying of the resource. 3) Demand for bandwidth 
fluctuates very rapidly and tends to be elastic. 

Recently, the need for a dynamic bandwidth commodity market has been 
recognized. Bandwidth exchanges have been developed between new network companies 
and Internet service providers. The old system of signing lengthy contracts after weeks or 
months of negotiation does not move fast enough for buying and selling of many Internet- 
related resources, such as bandwidth. 

The allocation of information service resources can be viewed as an exchange of 
commodities. In particular, in the case of bandwidth, Dow Jones has announced plans to 



1 

SUBSTITUTE SHEET (RULE 26) 



WO 01/88811 PCT/US01/15424 

launch a bandwidth index that will provide price for long-haul data routes and will enable 
companies to use these figures to peg the changing process for contracts when they are 
buying access to networks. 

The emerging telecommunications market call for new mechanisms for real-time 
5 trading of network resources, such as bandwidth. In a convention human-based resource 
trading system, resources for sale are posted, such as on a bulletin board or similar online 
location. Human beings peruse the posted resources and try to decide which resources their 
organizations will need in the future. The humans then place bids against each other for the 
resources. Such human-based resource trading systems usually operate on large amounts of 
10 bandwidth at a time and trades are performed periodically, the period being fairly long due 
to the limits of human attention span and speed. For example, it would be impractical for a 
human-base trading system to buy and sell resources in one minute or five minute partitions, 
since human beings are not capable of such speed. In addition, most human-based resource 
agents rely on additional human interaction to implement the results of the resource bidding. 

15 

Summary of Embodiments of the Invention 

The present invention provides a platform for resource allocation in real-time. One 
or more software resource agents interact with software player agents, which are usually 
both s and seller agents, to reach an agreement on price and quantity allocations for each 

20 buyer of that resource (for example, X Mbs of bandwidth for Y units of time, at price Z for 
buyer A). The s operate in accordance with one or more strategy rules for that agent. A 
strategy rule tells the what strategy to use in bidding against other agents for particular 
resources. The s also operate in accordance with valuation rules that tell the how to value a 
particular resource when bidding (this value is often used as a part of the strategy rule). 

25 Seller agents also contain their own strategy and valuation rules, which allow them to 

decide how much of a resource to offer and how to set a minimum price for the resource. 
Both player agents (buyer and sellers) and resource agents are aware of a global allocation 
rule used by the resource agent to allocate a resource between the buyers. In the buyer and 
seller agents, this allocation rule is often considered in determining strategy. 
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In the described embodiment, player agents (buyer and seller) also contain a 
graphical user interface that allows human beings to set their rules and to control various 
aspects of the player agent. 



SUBSTITUTE SHEET (RULE 26) 



WO 01/88811 



PCT/US01/15424 



In general, the present invention promotes the sharing of a limited resource, such as 
bandwidth, buffer space, memory space, storage, or processor time, in a competitive 
environment. This environment ensures that whomever need the most resource and has the 
ability to pay will get a share of the resource in accordance with willingness to pay. In 
addition, the invention adjusts for the changing needs of the participants over time. 

Advantages of the invention will be set forth in part in the description which follows 
and in part will be apparent from the description or may be learned by practice of the 
invention. The objects and advantages of the invention will be realized and attained by 
means of the elements and combinations particularly pointed out in the appended claims and 
equivalents. 

Brief Description of the Drawings 

Fig. 1 is a block diagram of an embodiment of the present invention. 

Fig. 2 is a flow chart of a method performed by a resource agent of Fig. 1. 

Fig. 3 is a flow chart of a method performed by a of Fig. 1. 

Fig. 4 is a flow chart of a method performed by a seller agent of Fig. 1 . 

Fig. 5 is a chart showing an example of the result of an allocation rule. 

Figs. 6(a) and 6(b) show an example of a valuation rule. 

Fig. 7(a) and 7(b) show an example of a strategy rule. 

Fig. 8 shows an example of a flow chart used by an accounting system of an 
embodiment of the invention. 

Fig. 9(a) is a more detailed block diagram of the embodiment of Fig. 1. 

Fig. 9(b) shows an example of how a bandwidth resource is given to the winning 

bidder. 

Fig. 10 shows an example of an embodiment of the present invention including a 
"garage" for player agents. 

Figs. 1 l(a)-l 1(b) are an example of an XML file for a generic player agent. 

Figs. 12(a)- 12(c) shown an example of Java code implementing a strategy rule for a 

Figs. 13 (a)- 13(c) show an example of Java code implementing an allocation rule for 
a resource agent 104. 

Fig. 14 is an example of an XML file for a generic resource agent. 

Figs. 15(a)-15(r) show examples of user interfaces for buyer and seller agents. 

3 
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Detailed Description of Embodiments 

Reference will now be made in detail to several embodiments of the present 
invention, examples of which are illustrated in the accompanying drawings. Wherever 
5 practicable, the same reference numbers will be used throughout the drawings to refer to the 
same or like parts. 

The present invention uses distributed, self-optimizing software agents to perform 
resource allocation more efficiently than centralized or human-based systems. 

Fig. 1 is a block diagram of an embodiment of the present invention. Fig. 1 includes 

10 a software player agent 102, which represents multiple and seller agents. A typical system 
will, include both buyer and seller agents 102. Fig. 1 also includes a software resource 
agent 104, a software accounting agent 106, a network control and management agent 108 
and a resource 110. It is contemplated that resource 110 can be a number of different 
resources, including but not limited to: bandwidth, buffer space, memory space, storage, or 

15 processor time. As shown in Fig. 1, each player agent (buyer and seller agent) can contain a 
Graphical User Interface (GUI) 122, one or more valuation rules 124, one or more strategy 
rules 126, and one or more allocation rules 128. The resource agent 104 also contains the 
same one or more allocation rules 128. 

s 102 place bids to the resource agent 104, which ultimately decides which of the 

20 player agents is awarded a portion of each resource for a predetermined amount of time. 

Resource agent 104 also controls resource 110, based on the results of the bidding, 
by sending an allocation command to network control and management agent 108. Thus, 
for example, if the resource is Internet bandwidth, if buyer A has won X Mbs of bandwidth 
for 5 minutes, resource agent 104 directs the network control and management agent 108 to 

25 give precedence to packets originating at buyer A for the next five minutes or to ensure that 
the agreed upon bandwidth requirements are met for buyer A's packets for the next five 
minutes. The mechanism used to communicate the allocation command from resource 
agent 104 to agent 108 can be any appropriate mechanism. In the example shown, the 
allocation command will include an identification of the winning buyer or buyers and an 

30 identification of the amount of resource allocated and the time period for which it is 

allocated. Other appropriate formats can be used without departing from the spirit of the 
invention. In some embodiments, agents 104 and 108 are separate agents (as shown in the 
figure). In other embodiments, their functions are merged into a single agent. Agents 104 
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and 108 can be owned and/or controlled by the same entity or by different entities. For 
example the owner of resource 110 could outsource the market-based allocation to a 
business partner operating resource agent 104, while keeping the network management and 
control function in its own hands, by retaining operation of the network control and 
5 management agent 108 themselves. 

In the embodiment shown, network control and management agent 108 controls 
resource 1 10 to implement the allocation command received from resource agent 104. 
Thus, agent 108 commits the resource allocated to a player after the resource agent 104 has 
closed the bidding. Network control and management agent 108 provides a common 

10 interface for resource agent 104 for all systems supported. In certain embodiments, there is 
one agent 108 per resource. Alternately, a single agent can control multiple resources and 
communicate with multiple resource agents 104. In the embodiment shown, network 
management and control agent 108 controls resources belonging to other entities (i.e., not to 
the owner of resource agent 104). In other embodiments, resource 110 might also be under 

15 the direct control of the owner of resource agent 104. 

Fig. 1 shows that network control and management agent 108 sends either Simple 
Network Management Protocol (SNMP) commands or COPS (Common Open Policy 
Service Protocol) commands to the resource. The SNMP protocol is well known and is 
described in RFC 1089, RFC 1270, RFC 1303, RFC 1298, RFC 1418, and RFC 1419, 
20 which are incorporated herein by reference in their entirety. The COPS protocol is well 
known and is described in "The COPS (Common Open Policy Service," dated March 5, 
1999, available from Adobe Systems, Inc. of San Jose, CA, and RFC 2748., both of which 
are incorporated herein by reference in their entirety. Other appropriate resource control 
protocols can be used without departing from the spirit of the invention. 

25 Once one or more winning buyers are determined, resource agent 104 alerts 

accounting agent 106, which keeps track of the winning buyers, as described below in 
connection with Fig. 9(a). Accounting agent 106 provides a common interface for all 
accounting systems supported. Agent 106 preferably contains a database handler for each 
system it supports. Although accounting agent 106is shown as using one or more of the 

30 IHN and SQL database interfaces, any appropriate interface to an accounting database could 
be used. 

5 
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The 102 operates in accordance with one or more strategy rules for that agent. A 
strategy rule tells the what strategy to use in bidding against other agents for particular 
resources and, therefore constitutes a bidding mechanism for agents. 

The s also operate in accordance with valuation rules that tell the how to value a 
5 particular resource when bidding (this value is often used as a part of the strategy rule). 
Thus, a valuation rule typically tells an agent how to value each unit of a resource over a 
range of possible quantities, at a given time. Seller agents also contain their own strategy 
and valuation rules, which allow them to decide how much of a resource to offer and how to 
determine a minimum price for the resource. Strategy and valuation can depend on external 

10 information, such as accounting information and network congestion. 

The valuation and strategy of player agents (buyer and sellers) and resource agents 
are aware of a global allocation rule used by the resource agent to allocate a resource 
between the buyers. In the buyer and seller agents, this allocation rule is often considered in 
determining valuation and/or strategy. Thus, an allocation rule typically can be thought of as 

15 corresponding to a market mechanism. Examples of allocation rules include English 

auctions (familiar to persons who frequent human-based antique and estate sale auctions), 
Reverse Price Auctions (such as the main eBay model), Dutch auctions, and continuous bid- 
ask trading. Examples of allocation rules are found, for example, in 1) J.F. Rosenschein 
and G. Zlotkin, "Rules of Encounter," MIT Press 1994; 2) PhD thesis of N. Semret, 

20 "Market Mechanisms for Network Resource Sharing," Dept. of Electrical Engineering, 

Columbia University, submitted approximately May 1999; and 3) H.R. Varian, "Economic 
Mechanism Design for Computerized Agents," USENEX Workshop on Electronic 
Commerce, July 1995. Each of these three references is incorporated herein in its entirety. 
Fig. 2 is a flow chart of a method performed by resource agent 104 of Fig. 1 . In at 

25 least certain embodiments, there is more than one resource agent - one for each resource. 
For example, if an ISP offers several different bandwidths, each bandwidth might be 
considered a separate resource, controlled by a separate resource agent 104. Resource 
agents preferably run within servers that can be distributed over a network. Alternately, one 
or more resource managers 104 can reside on the same system. 

30 As shown in Fig. 2, resource agent 104 receives 202 one or more bids for resource 

110 from one or more s 102. Such bids will usually include at least quantity and price 
values. If a buyer is outbid, the resource agent notifies 204 the buyer, unless the trading 
period is over 206. Once the trading period is over, the resource agent notifies 208 the 
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winning or agents and proceeds to send an allocation command 210 to network control and 
management agent 108 that will cause agent 108 to control the resource in accordance with 
the winning bids. 

Fig. 3 is a flow chart of a method performed by player/buyer 102 agent of Fig. 1. 
5 First, the is apprised of potential resources available for bidding. This is accomplished 
either by the player requesting current resource auction information from a centralized 
directory service (not shown) or by the player registering with the resource agent and 
periodically being sent information about current bidding. With a directory service, a player 
agent 102 queries the directory service to find the location (e.g., in the form of a URL) of 

10 resource agents 104, garages (see Fig. 10), and other player agents 102. The decides 302 
whether to bid on a particular unit or resource 110 and sends 304 a bid to the resource agent. 
If the receives a notice that he has been outbid, control returns to element 302 and the agent 
decides whether to continue bidding. 

In element 302, the uses its knowledge of the system market allocation rule and of 

15 its own valuation rules to determine how much it is willing to pay for resources at a given 
time and uses its own strategy rules to determine whether to bid on available resources. If 
the includes a GUI, a human being can visualize the market using various known graphs, 
charts or similar graphics. A user can also use the GUI to change the strategy and valuation 
rules used by the agent. 

20 It should be noted that certain embodiments include a special type of player/, called 

a broker agent, which buys resources with the intent of reselling those resources.. Thus, for 
example, a broker agent has no user for the resource (such as bandwidth) itself, since the 
broker is not an ISP or similar entity in need of bandwidth. Instead, the broker arranges for 
third party customers, such as ISPs, to receive the benefit of the resources the broker has bid 

25 for and won, by reselling them through another resource agent instance, and the broker 
keeps the difference between the buying and selling price.. In such a case, the new 
resource agent 104 informs the network control and management agent 108 which party is 
to benefit from the resource bid for by the broker. Fig. 4 is a flow chart of a method 
performed by a player/seller 102 agent of Fig. 1. A seller agent first notifies 402 the 

30 resource agent that it has one or more units of a resource to sell (for example, IMbs for 5 
minutes or 10 minutes of processor time). Once the bidding is over, seller agent 102 will 
receive from resource agent 104 a notification 404 of the winning bidder or bidders. In at 

7 
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least one embodiment, the seller agents collect the payments from the buyers based on 
accounting information retrieved from the accounting agent. 106. 

Fig. 5 is a chart showing an example of the result of a Progressive Second Price 
Auction (PSP) allocation rule. Other allocation rule types, as discussed above, can also be 
5 used, as specified by the particular allocation market strategy implemented in a particular 
system. Fig. 5 shows how resources are allocated and prices set once the bidding is over 
and it is determined that there are not enough resources to satisfy all the bidders. 

In the PSP auction of Fig. 5, bidder A bids $3 for 50 resource units; bidder B bids $2 
for 30 resource units; bidder C bids $1 for 30 resource units; and bidder D bids $0.50 for 20 
10 resource units. At the close of bidding, resource agent 104 allocates the resources as 
follows: The 100 available resource units are apportioned between the bidders until the 
resource is gone. Thus, the high bidder A is allocated all of the resource that he wants, as is 
the second high bidder B. Bidder C only gets 20 or the 30 resource units that he wanted and 
bidder D gets nothing, since there are no more resource units to allocate after partially 
1 5 fulfilling bidder C 's wants. 

The resource agent 104 determines the amount that the bidders in the PSP auction 
are charged as follows: For each bidder, the resource agent determines the value of the 
bidder's resource if that bidder had not participated, and charges the bidder a price based on 
this determination. 

20 Thus, if bidder A had not participated, his 50 units would have been allocated as 

follows: 

10 units to bidder C (to make up C's shortfall) 
20 units to bidder D (to give D his requested number of units) 
The remaining 20 of bidder A's units would not have been allocated. 
25 The cost to bidder A is determined as follows: 

The cost of a resource unit to bidder C is $1/30. 

Thus, the value that would have been given to bidder C: 10 x ($1/30) = $0.33 



The cost of a resource unit to bidder D is $0.50/20. 
30 Thus, the value that would have been given to bidder D: 20 x ($0.50/20) = 

$0.50 
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Thus, the cost of A's resources had A not participated is $0.33 + $0.50 = $0.83. 
Bidder A is charged this amount for his 50 units of resource. 

Similarly, if bidder B were not present, his 30 units would have been allocated as 
follows: 

5 10 units to bidder C (to make up C's shortfall) 

20 units to bidder D (to give D his requested number of units) 

The cost to bidder B is determined as follows: 

10 The cost of a resource unit to bidder C is $1/30. 

Thus, the value that would have been given to bidder C: 10 x ($1/30) = $0.33 

The cost of a resource unit to bidder D is $0.50/20. 

Thus, the value that would have been given to bidder D: 20 x ($0.50/20) = 

15 $0.50 

Thus, the cost of B's resources had B not participated is $0.33 + $0.50 = $0.83. 
Bidder B is charged this amount for his 30 units of resource. 

20 Similarly, if bidder C were not present, his 20 units would have been allocated as 

follows: 

20 units to bidder D (to give D his requested number of units) 
The cost to bidder C is determined as follows: 

Value that would have been given to bidder D: 20 x ($0.50/20) = $0.50 

25 

Thus, the cost of C's resources had C not participated is $0.50. Bidder C is charged 
this amount for his 20 units of resource. 

It should be noted that Fig. 5 shows only how resources are allocated after bidding is 
closed. An allocation rule also includes within it rules or explanations of how the auction 
30 itself should be conducted. For example, a PSP auction generally lasts for a predetermined 
amount of time (for example, five minutes). While the bidding is open, resource agent 104 
collects all bids received from the s 102 and saves them in a bidlist data structure (e.g., a 
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linked list). The bidlist data structure indicates which bid is the most recent bid for each 
102. 

As bids are received from the s 102 by the resource agent 104, the resource agent 
104 transmits the bids received to the other agents 102, so that all agents know what all 
5 other participating agents are bidding. Because each 102 has knowledge of the allocation 
method, each 102 can apply its strategy and valuation rules to determine whether that agent 
is going to be allocated the resources on which it has bid. The agent, applying the 
allocation, strategy, and valuation rules, determines whether it should bid again. In a PSP 
auction, bidding usually stabilizes after a few minutes. In some embodiments, resource 

10 agent 1 04 does not hold the auction open for a predetermined time, but instead waits a 
predetermined amount of time after the bidding has stabilized to make sure that no other 
bids are received. In some cases, resource agent 104 announces to the s 102 that bidding 
will close in a certain number of minutes or seconds. In some cases, if a bid is received 
during this time period, the auction is kept open a bit longer. 

1 5 Thus, an allocation rule (known to all agents) also includes information about how 

the auction will be conducted by resource agent 104, including, for example: how long an 
auction will last, if the auction has no set time period, what are the conditions for the 
auction to close. In addition, as described above, the allocation rule includes rules 
describing how price and quantities are assigned after the auction has closed. 

20 In a preferred embodiment of the invention, auctions last 5 minutes, although other 

periods of time could be used and these periods of time could be either variable or user- 
settable. In a preferred embodiment, the bandwidth resource being auctioned during a 
current auction is allocated immediately and another auction is begun immediately. Thus, 
an auction occurs roughly every five minutes for the bandwidth that will be used by the 

25 buyers during the next five minutes. Other embodiments may not auction all bandwidth for 
immediate use. 

The PSP auction model is described further in A.A. Lazar and N. Semret, "Design 
and Analysis of the Progressive Second Price Auction for Network Bandwidth Sharing," 
Telecommunications Systems, Special issue on Network Economics, which is attached 
30 hereto as Appendix A and forms a part of this application. 

Other examples of allocation rules include, but are not limited to a Hold Option 
Auctions, which are discussed, for example, in the PhD thesis of N. Semret, 'Market 
Mechanisms for Network Resource Sharing," Dept. of Electrical Engineering, Columbia 

10 
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University, submitted approximately May 1999, Chapter 4 of which (28 pages) is attached 
hereto as Appendix B and which forms a part of this specification. The Hold Option is a 
concept for advance price and quantity guaranteed reservations of network resources in a 
real-time market environment. Periodic auctions (progressive second price, or other) among 
5 arrivals grouped in batches give rise to the spot market of capacity changes. A reservation 
guaranteeing access for an arbitrary duration with a capacity piece below the bid can be 
made at any time before or during service. This eliminates the risk (which is inherent on the 
spot market) of losing resources to higher bidders before service completion. The 
reservation is defined as a hold option, and is analogous to derivative financial instruments 

10 such as options and futures integrated over time. Based on a heavy traffic diffusion model, 
reservation fees can be computed as the fair market price of a hold option. In at least one 
embodiment, special player agents 102 are allowed to place such hold options, thus 
providing a guaranteed reservation of a network resource for an arbitrary duration. 

Figs. 6(a) and 6(b) show examples of valuation rules. In Fig. 6(a), a valuation rule is 

1 5 formed of pairs of quantity /price values. In Fig. 6(b), a valuation rule is formed as a 

function of various input variables. These input variables can include any or (but are not 
limited to): the number of hits on a web site ; the average file size downloaded (and the 
bandwidth needed to service those files); the expected delay or expected latency, and the 
value of each hit. Valuation can also be time dependent (e.g., higher valuations are assigned 

20 during peak usage times when more bandwidth is needed) and the network state (e.g., more 
bandwidth is needed if the network is congested). 

It should be noted that there are engineering tradeoffs for the type of valuation rule 
used. Because a low-level valuation rule requires more inputs, which require more time and 
effort to collect and receive, a low-level description may require a large amount of data to 

25 be transferred in order for the agent to be able to bid in accordance with the low-level 

valuation rule. On the other hand, a simple, high-level valuation rule reduces the ability of 
an agent to make an optimum bid because the agent is operating with less information. 
Either implementation can be correct for a given circumstance, depending on the needs of 
the particular player agent and the limitations of its system and network. 

30 Figs. 7(a) and 7(b) shows examples of strategy rules. Fig 7(a) shows a simple rule 

set, where the first precedent is to identify the type of allocation system being used. Once 
the allocation system if identified, the 102 applies a set of predefined conventional rules to 
determine whether it should bid (or bid again). Fig. 7(b) shows an example where the 
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strategy is based on a user-defined function. In the example, if the function reaches a 
threshold value, the agent 102 will bid (or bid again). Other examples of strategy rules 
decide not just whether the agent should bid, but how much the agent should bid, in 
accordance with the allocation rule being used by the system and in accordance with factors 
5 specific to that agent (such as, for example, those factors discussed above in relation to 
valuation rules). 

Certain strategy rules are very simple and involve bidding constant amount. Such 
simple strategy rules may result in uneven amounts of bandwidth being won. Another 
example strategy rule is periodic bidding, in which a buyer agent enters auctions 
10 periodically. 

Fig. 8 shows an example of a flow chart used by an accounting system of an 
embodiment of the invention. The accounting system is notified whenever a resource agent 
closing bidding on a resource unit and keeps track 802 of the winning bids and resulting 
allocations. Periodically, the accounting system bills 804 the seller a percentage of the 

1 5 resources sold, which is received by the owner of the resource agent for operating the 
resource agent and account agents of the Merkato platform. 

Fig. 9(a) is a more detailed block diagram of the embodiment of Fig. 1. It will be 
understood that Fig. 9(a) details only one possible way to implement the present invention 
and that the description herein is not to be taken in a limiting way with regard to operating 

20 systems, protocols, programming languages, etc. The example shown is implemented in . 
Java using the World Wide Web, but the invention is not limited to such a system. In fact, 
the invention can be implemented on any appropriate computers and networks, using any 
appropriate programming language, hardware, software, and operating system. Parts of the 
system can be implemented in software, firmware, or hardware, as needed. 

25 Fig. 9(a) includes four layers: an operating system (OS) layer 902, a Java layer 904, 

a Merkato layer 906, and a diffex layer 108. In the described embodiment, the OS layer 902 
and the Java layer 904 are conventional and will not be described herein. Other 
embodiments may make changes to one or more of layers 902 and 904 to enhance the 
performance of the system, for example. In this example, the use of Java layer 904 makes 

30 the system platform independent. Thus, the Merkato layer 906 can run on any computer or 
computing device (such as a wireless device , pager, cell phone, or Internet appliance) 

Layer 906 allows a player agent 102 to be executed on the client side, from a Java 
application or an applet executing in a Web browser. Alternately (or in addition), a player 
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agent 102 can be executed as a servlet on a web server, which is the "garage" environment 
of Fig. 10. 

Layer 908 enables real-time resource markets between peering ISPs. The layer 908 
allows ISPs to buy and sell resources, such as bandwidth, from each other in real-time. 
5 Layer 908 auctions in real-time the outgoing bandwidth on each ISP's line out of the 

exchange, ensuing that at all times, the bandwidth goes to the buyer with the highest value 
for it. Layer 908 also allows for buying and selling in advance (i.e., making reservations 
with guaranteed capacity and capped prices) through a derivative market of options, in 
effect enabling the trading of risk and hedging, for original ISP buyers and sellers, as well 

10 as for purely financial players. 

Fig. 9(b) shows and example of how a winning buyer ISP 952 is given a resource, 
such as bandwidth. In Fig 9(b), a seller ISP 956 has a seller 902 agent running on a diffex 
client. As discussed above, the seller agent determines that the ISP has bandwidth to sell 
(e.g., through user input via the seller agent GUI) and makes that bandwidth available for 

15 auction by sending a message to resource agent 904 (see numeral 1 in a circle). The buyer 
ISPs 952, 954 have s 902 running on a diffex client. The s bid on the bandwidth in 
accordance with their allocation rules, strategy rules, and valuation rules, as discussed above 
(see numeral 2 in a circle). For certain types of allocation models, the agents also receive 
information about bids during the auction (not shown). Once the auction is concluded, 

20 resource agent 904 sends an allocation command to network control and management agent 
908 (numeral 3 in a circle), which in turn controls a router 970 of the seller ISP so that the 
winning buyer ISP receives its bandwidth (see numeral 4 in a circle). 

Resource agent 104 in layer 908 interfaces with the resource (e.g., with the router 
970 of the seller ISP) to allocate the bandwidth to the winning bidder. Specifically, 

25 resource agent 904 interfaces with control agent 908 to send commands to the router of the 
seller ISPs. If, for example, the resource is bandwidth, allocation to the winning bidder can 
be effected by one of the following or by any other appropriate method: 

a) the router 970 is given a set of class-based weighted-fair queuing 
parameters to be used by the router to control packet queuing in the router so that the 

30 winning ISP buyers are assured of receiving the bandwidth which they was allocated by the 
resource agent. These queuing parameters will give priority to the winning bidders when 
packets from the winning ISPs are sent to the seller ISP's router. 
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b) the router is given a set of committed access-rate parameters to be used by 
the router to limit and shape traffic assure each buyer the bandwidth which it is 
allocated, or 

c) capacity within an MPLS (Multiprotocol Label Switching) tunnel 
5 between two points in the seller's network. 

Thus, as shown in Fig. 9(b), packets 960 from the winning buyer ISP(s) are routed through 
the seller's router and on to the seller ISP's network 956, from which they are delivered to 
their destination. In at least one embodiment, the winning ISPs are aware that they have 
won and use existing routing protocols to ensure that they direct packet traffic to the seller 

10 ISP's router. In other embodiments, the resource agent informs routers in the winning ISPs 
of the needed routing change using known routing protocols. 

In the described embodiment, the resource agent is always executed on a Web 
server. Each resource agent 104 runs as a servlet on a Web server. This architecture is 
scalable because resource agents 104 can be distributed to run on any Web servers 

1 5 supporting the concept of a servlet. 

In the described embodiment, communications between a player agent (either a 
buyer or a seller) 102 and a resource agent 104 are performed via a resource agent proxy 
using any of http extensions, native TCP protocol, or Java's remote method invocation 
(rmi). All communications are secured through an appropriate mechanism such as the 

20 secured socket layer (SSL). A shown in Fig. 9(a), each agent (player and resource) has an 
allocation rule object 128. The player agents 102 also have valuation rule objects 124, 
strategy rule objects, 126, and a GUI object 122. In addition, the resource agents 104 have 
networking and accounting drivers for interfacing with external support systems. Further 
security is provided through the implementation of specific allocation rules that protect the 

25 stability of the system. Specifically, each user is required to pay a bid fee to resource agent 
1 04 for every bid sent. The implementation of a bid fee is intended to prevent users from 
trying to artificially destabilize the resource price and to prevent a "man in the middle" 
attack, which is defined as a third party intercepting a bid from another agent and keeping 
sending the same bid over and over. In addition, timestamps are preferably used to 

30 discriminate every bid. Thus, if a bid has a previously used timestamp, resource agent 104 
will ignore that bid. 

In the described embodiment, communication between agents is effected using http, 
thus bypassing many problems associated with firewalls, proxies, and other middleware. 
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Fig. 10 shows an example of an embodiment of the present invention including a 
"garage" for player agents 102'. The garage 130 is a component on a web server that serves 
the purpose of storing agents and enabling their execution remotely from a user's computer. 
The garage contains an "attendant" program 131 whose job is to help mobile agents find a 
5 parking place in the garage and to ensure proper agent execution when the agents want to 
run autonomously. Garage 130 performs the function of a distributed database to store the 
agents 102' and provides a distributed processor (not shown) to execute the agents. Garage 
130 can be located on the same system as resource agent 104, but can also be located on a 
different machine. 

10 The attendant is an example of a multi-agent. Multi agents coordinate multiple 

simple agents to act together. Multi-agents can be used to form coalitions of agents, using 
the attendant to bid for aggregated resources on behalf of the coalition. In the described 
embodiment, the agents communicate with the attendant to let the attendant know that they 
need resources. The multi-agent aggregates the various types of resources requested (e.g., 

1 5 different connection speeds or different bandwidths) and uses the allocation rule and its own 
strategy and valuation rules to bid on behalf of the agents. If the multi-agent wins, the 
resource is divided between the agents and the attendant so informs the resource agent 104, 
which allocates the bandwidth accordingly. A broker is an example of a multiagent, 
coordinating buyer and seller agents for a common objective to act as a profit-maximizing 

20 reseller. 

In the described embodiment, agent mobility to the garage is provided through 
XML. The syntax and semantics of an agent is described using XML. For example, the 
semantics of an agent can include its allocation, strategy and valuation rules. Each agent 
can be transferred to the "garage" 130 by generating its XML description. Once an agent is 

25 provided in this form, it can be re-instantiated either in a garage 130 or in a user's computer 
in an applet or a java application. 

Figs. 1 l(a)-l 1(b) are an example of an XML file for a generic player agent 102. 
This XML would be used, for example, to send the agent 102 to a client for execution in 
garage 930. Each of the XML tags (indicated by o brackets) identifies an attribute of the 

30 agent that is to be activated in the garage. It will be understood that the XML of Fig. 1 1 is 
shown for purposes of example only and that other embodiments of the invention may use 
more or fewer tags and/or different tags than those shown in Fig. 1 1 . 
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Figs. 12(a)-12(c) shows an example of Java code implementing a strategy rule for a 
102. This example implements a strategy in accordance with the PSP auction model 
described above. As shown in section 1234 of Fig. 12(c), an agent using this strategy will 
submit a new bid only if the new bid determined in the rule is increased by at least a 
5 calculated value epsilon. Note that this strategy rule calls a valuation rule in line 1232 of 
Fig. 12(c). This valuation rule implements a PSP valuation method. 

Figs. 13(a)-13(c) show an example of Java code implementing an allocation rule for 
a resource agent 104. This example looks at a number of received bids in a bidlist data 
structure and computes an allocation (similar to that of Fig. 5) given the current bids on the 

10 bid list in accordance with a PSP auction allocation model. 

Fig. 14 is an example of an XML file for a generic resource agent 104. This XML 
would be used, for example, to send the resource agent 104 to a web server. Each of the 
XML tags (indicated by o brackets) identifies an attribute of the agent that is to be 
activated on the server side. 

15 Figs. 15(a)-15(r) show examples of user interfaces for buyer and seller agents. Figs. 

15(a) and 15(b) show an example of an html GUIs that provides static information to a user. 
Figs. 15(c) and 15(d) show an example of GUIs implemented as a Java applet. The 
information provided by these interfaces is continuously updated in real-time or 
periodically. Figs. 15(e)-15(r) show an example of an advanced GUI, which is preferably 

20 also implemented as an agent. Particular buyer and seller agents may have one, none, or all 
of the particular GUIs shown here, or may have GUIs providing other relevant information 
not shown here. Note that several of these GUIs allow a human user to choose the valuation 
and/or strategy rules and to determine when and how to bid. It should be understood that in 
the preferred embodiment, buyer agents are capable of bidding on their own. Other 

25 embodiments may contain agents that bid only at the direction human beings. 

Accordingly, the present invention is intended to embrace all such alternatives, 
modifications and variations as fall within the spirit and scope of the appended claims and 
equivalents. 



16 

SUBSTITUTE SHEET (RULE 26) 



WO 01/88811 



PCT/US01/15424 



APPENDIX A 
Docket No.: 61624-04980 

Design and .Analysis of the Progressive Second 
Price Auction for Network Bandwidth Sharing* 

Aurel A. Lazar 
Dept. of Electrical Engineering, Columbia University 
New York, NY, 10027-6699, USA 
aureKDcomet . Columbia, edu 
and 

Nemo Sennet* 

Dept. of Electrical Engineering, University of California, Los Angeles 
Los Angeles, CA, 90095-1594, USA 
nemoQee . ucla . edu 

April 1, 1998 
Revised: September 30, 1999 



Abstract 

We present the Progressive Second Price auction, a new decen- 
tralized mechanism for allocation of variable-size shares of a resource 
among multiple users. Unlike most mechanisms in the economics lit- 
terature, PSP is designed with a very small message space, making 
it suitable for real-time market pricing of communication bandwidth. 
Under elastic demand, the PSP auction is incentive compatible and 
stable, in that it has a "truthful" f-Nash equilibrium where all players 
bid at prices equal to their marginal valuation of the resource. PSP 
is economically efficient in that the equilibrium allocation maximizes 
total user value. With simulations using a protype implementation 
of the auction game on the Internet, we investigate how convergence 
times scale with the number of bidders, as well as the trade-off between 
engineering and economic efficiency. We also provide a rate-distortion 

•Pans of this work were presented at the 8 th International Symposium on Dynamic 
Games and Applications, Maastricht, The Netherlands, July 1998, and at the DIM ACS 
Workshop on Economics. Game Theory, and the Internet, Rutgers, NJ, April 1997. 

'Corresponding author. 
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theoretic basis for valuation of bandwidth, which leads naturally to the 
elastic demand model that is assumed in the analysis of the mechanism. 

Keywords: resource allocation, auctions, game theory, mechanism 
design, network pricing. 

1 Introduction 

Communication networks axe characterized by what economists call exter- 
nalities. The value a user gets from the network depends on the other 
users The positive externalities are that a communication network is more 
valuable if more people are connected. The negative externalities are that 
resources are shared by users who - because of distance, population size, or 
selfishness - cannot or will not coordinate their actions sufficiently to achieve 
the most desirable allocation of resources. The recognition of this reahty m 
many aspects of networks and distributed computations has lead in recent 
years to the emergence of game theoretic approaches in their analysis and 

desien f23, 9, 25, 33, 15, 16]. , „. 

Prices, whether they relate to "real money" in a public network or "funny 
money" (based on quotas) in a private system, play a key role as allocation 
control signals. In the former case, this role is of course intimately tied to 
another, which is to allow a network provider to remain: in business 17J. 

The telephone system and the current Internet represent two extremes 
of the relationship between resource allocation and pricing. The resources 
allocated to a telephone call are fixed, and usage prices are based on the 
predictabibty of the total demand at any given time. On the Internet, the 
current practice of pricing by the maximum capacity of the user s connection 
(flat-rate pricing) decouples the allocation (actual use) of resources from the 

PnC S'the emerging multiservice networks (ATM, Next-Generation Inter- 
net), neither of these approaches are viable. The former because of the wide 
and rapidly evolving range of applications (including some which adapt ^to 
resource availability) will make demand more difficult to predict. And the 
latter because, once the flat fee is paid, there are no incentives to limit usage 
since increasing consumption benefits the user individually, whereas limiting 
it to sustainable levels brings benefits which are shared by aJL This makes 
it vulnerable to the well-known "tragedy of the commons". With flat pric- 
ing alone, the tendency is toward increasing congestion which chases away 
high-value users, or increasing prices which exclude low-value users [6]. in 
both cases leading to decreased network revenue. 
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Thus there is a need to develop new approaches to pricing of network 
resources. Among the requirements are: sensitivity to the range of resource 
requirements (either through a sufficiently rich range of traffic classes which 
are priced differently, or by allowing users to explicitly quantify resource 
requirements); prices must be dynamically responsive to unpredictable de- 
mand (market based system); perhaps most importantly the pricing archi- 
tecture should constrain as little as possible the efficiency trade-offs of the 

^mSed^the fundamental issue in designing pricing policies is the trade- 
off between engineering efficiency and economic efficiency. This trade-off, 
which is more or less constrained by the underlying network technology, has 
many dimensions, including: 

• how much measurement (from usage to capacity pricing), 

• the granularity of differently priced service offerings (e.g. number of 
traffic classes), 

• the level of resource aggregation - both in time and in space - at which 
pricing is done (per packet/cell or per connection, at the edge of the 
network or at each hop), and 

• the information requirement (how much a priori knowledge of user 
behavior and preferences is required/assumed by the network in com- 
puting prices). 

An approach which achieves economic efficiency is the smart-market ap- 
proach of [19], wherein each packet contains a bid, and if it is served, pays 
a clearing price given by the highest bid among packets which are denied 
service (dropped). This approach is incentive compatible in that the optimal 
strategy for a (selfish) user is to set the bid price in each packet equal to 
the true valuation. Each node in the network becomes an efficient market, 
but the engineering cost (sorting packets by bid price, as well as per-packet 
and per-hop accounting) could be significant if line speeds are high relative 
to the processing power in the router. In [14], users are charged according 
to a combination of declared and measured characteristics of traffic By 
taking an equivalent bandwidth model of resource utilization, and assuming 
appropriate traffic models, a menu of pricing plans indexed by the declared 
traffic can be offered which encourages users to make truthful declarations 
(e g of the mean rate), and also encourages the users' characterization ef- 
forts to be directed where they are most relevant to the network resource 
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allocation. As the pricing is relative, [14] does not aim to address the prob- 
lem of determining the actual monetary values of the market price (that 
users would be willing to pay). Another pricing scheme wluch incorpo- 
rates multiplexing gain is formulated in [12]. These and a number of other 
schemes are summarized in [11], in a comprehensive view of the connection 
establishment process, which identifies the user-network negotiation as the 
kev "missing link" in network engineering/economic research. In terms ol 
our taxonomy of the previous paragraph, this is part of the informauon re- 
quirement trade-off Indeed, in the absence of formal mechanisms to deal 
S the information problem, complex and (at least mtmtivdy) undesirable 
things happen. For example, some providers offer expensive "front of the 
3 rates to uninformed customers, and lower "back of the book" rates to 
informed customers who may be about to defect to another earner (s^ [7] 
and also, the recent wars between AT&T and MCI in consumer long-distance 
service in the United States). In [34], it is argued that architectural consid- 
erations such as where charges are assessed should take precedence over the 
pursuit of optimal efficiency, and edge pricing (spatial aggregation in terms 
of our taxonomy) is proposed as a useful paradigm. 

In this paper, we propose anew auction mechanism which accommodates 
various dimensions of the engineering-economics trade-off. The mechanism 
applies to a generic arbitrarily divisible and additive resource model (which 
may be equivalent bandwidth, peak rate, contract regions, etc., at any level 
of aggregation.) It does not assume any specific mapping of resource allo- 
catiSTto quality of service. Rather, users are defined as having an explicit 
monetary valuation of quantities of resource, which the network doesn t or 
can't know a priori. Thus, in terms of our tradeoff taxonomy, this mech- 
anism aims for unlimited granularity, flexibility in the level of aggregation 
and minimal information requirement. 

In the most likely auction scenaria, users would be aggregates of many 
flows data flows for which bulk capacity is being purchased for e.g. Virtual 
Paths, Virtual Private Networks, or edge capacity (2, 31]. 

We begin in Section 2 by formally presenting the design of our Progres- 
. sive Second Price auction mechanism for sharing asingle arbitrarily divisible 
resource, and relating it to classical mechanism design from the economics 
litterature. In Section 3, after describing our model of user preferences 
and the elastic demand assumption, we prove that PSP has the desired 
properties of incentive compatibility, stability, and efficiency. The section 
concludes with simulation results on the convergence properties, and the 
efficiency trade-offs. Appendix A describes an information theoretic basis 
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for valuations of the type that are assumed in the analysis of Sections 3, as 
one possible justification. 

2 Design of an Auction for a Divisible Resource 
2,1 Message Process 

Following [36], it is useful to expose the design in terms of its two aspects: 
realization, where a message process that enables a certain allocation ob- 
jective is defined; and Nash implementation, where allocation rules are de- 
signed with incentives which drive the players to an equilibrium where the 
(designers) desired allocation is achieved. 

In this section we define the message process. Here we make the fun- 
damental choice which will constrain the subsequent aspects of the design. 
Our first concern here is with engineering. For the sake of scalability in a 
network setting, we shall aim for a process where a) the exchanged messages 
are as small as possible, while still conveying enough information to allow 
resource allocation and pricing to be performed without any a- priori knowl- 
edge of demand (market research, etc.); and b) the amount of computation 
at the center is minimized. 

Given a quantity Q of a resource, and a set of players X = {1, . ..,/}, 
an auction is a mechanism consisting of: 1) players submitting bids, i.e. 
declaring their desired share of the total resource and a price they are willing 
to pay for it, and 2) the auctioneer allocating shares of the resource to the 
players based on their bids. 

Player Vs bid is Si = (qi f pi) € Si = [0, Q] x {0, oo), meaning he would like 
a quantity at a unit price p t -« A bid profile is s = (<si,- . ••£/). Following 
standard game theoretic notation, let 5.,- = 1,5,*+],. ..,5/), i.e. 

the bid profile of player i'a opponents, obtained from s by deleting 5,-. When 
we wish to emphasize a dependence on a particular player's bid 5,-, we will 
write the profile s as (5i;s_ t ). 

The allocation is done by an allocation rule A, 

A: S S 

s = (q,p) = («(*)> *(*)), 

where S = liter 3f • 

The i-lh row of A(s), Ai(s) = (a,-(.s),c t («s)), is the allocation to player 
i: she gets a quantity Oi(s) for which she is charged c*(j). Xote that p is a 
price per unit and c is a total cost. 
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An allocation rule A is feasible if Vs, 



and Vi € 2, 



Ci(s) 



Remark a: The above formulation is a generalization of what is usually 
meant by an auction. The latter is the special case where a w (s) = Q for 
some winner w € I and 0,(5) = 0, Vt 5* i.e. the sale of a single indivisible 
object to one buyer, for which the theory is well developed [22, 24]. In our 
approach, allocations are for arbitrary shares of the total available quantity 
of resource. Equivalently, one could slice the resource into many small units, 
each of which is auctioned as ah indivisible object- But in a practical imple- 
mentation of auctions for sharing a resource, a process of bidding for each 
individual unit would result in a tremendous signaling overhead. More im- 
portantly, since the users would be bidding on a discrete grid of quantities, 
analytical predictions of outcomes could be misleading since they could be 
sensitive to the particular choice of grid 1 . 

Remark b: Most of the mechanism design literature in Economics 
makes use of the following "Revelation Principle": 

Given any feasible auction mechanism, there exists an equivalent 2 
feasible direct revelation mechanism which gives to the seller and 
all bidders the same expected utilities as the given mechanism. 
([24], Lemma 1) 

In this sharing context, a direct revelation mechanism would be one where 
each user message consists of the user's type, which is the valuation 3 of the 
resource over the whole range of their possible demands, i.e. a function 
$i : [0,Q] [0,oo), and the budget (see Section 3.1). A consequence of 
revelation principle is that the mechanism designer can restrict her atten- 
tion to direct revelation mechanisms, find the best mechanism in terms of 

2 For a more detailed discussion of this point, see [4] p. 34, and references therein. 

2 By equivalent, in [24] it is meant that, at some equilibrium, all players get the same 
utility. There may be other, possibly ill-behaved, equilibria. 

3 The valuation of a given amount of resource is how much the user is willing to pay for 
that quantity. The inverse of the valuation is the user's demand function, giving a desired 
quantity for each price. 
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her (economic) efficiency objectives, and then - if necessary - transform it 
into an equivalent mechanism in the desired message space. This is conve- 
nient because one can exclude the infinitely many mechanisms with larger 
message spaces, without fear of missing any better designs. The design pro- 
cess is usually the solution of an optimization (mathematical programming) 
problem. For this reason, in the litterature, mechanism design problems are 
mostly solved for cases where the space of users' types is one dimensional, 
or at most finite dimensional [21], using message spaces that are of the same 
dimension. 

In our sharing problem, the conventional approach is unsatisfactory in 
two ways: 

# First, a user's type is infinite-dimensional, as we do not restrict the 
valuation functions beyond some very general assumptions (see Sec- 
tion 3.1), and so the conventional "programming" approach of de- 
riving the mechanism from the revelation principle would lead to an 
intractable problem. 

• Second, the conventional (direct revelation) approach, even if it was 
tractable, implies that a single message (bid) can theoretically be in- 
finitely long, because it has to contain a description of the function 0 t *. 
Clearly, this is not desirable in a communication network, where sig- 
naling load is a key consideration. For engineering reasons, we choose 
a message space that is 2- dimensional. Therefore, a given message can 
come from many possible types, so there is no single way to do the 
transformation from the direct revelation mechanism to the desired 
one. 

Thus, unlike most of the mechanism design literature, we will take a di- 
rect approach, where we posit an allocation rule for our desired message 
space, and then show that it has an equilibrium, and that the design objec- 
tive is met at equilibrium 4 . This is equivalent to guessing the right direct- 
revelation- to-desired-mechanism transformation and building it into the al- 
location rule from the start. 

4 Our aim is to to show that we con use this smaller message space and still achieve our 
objective. 
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Figure 1: Exclusion-compensation principle: the intuition behind the PSP 
rule 



2.2 Allocation Rule 

Define, for y > 0 



£,.(y,*-i)= \q- £ »] 



(1) 



and _ + 

The "progressive second price" (PSP) allocation rule is defined as follows: 



o,(s) = g; AQ i (pi,s_ 1 ), 



(2) 
(3) 



where A means taking the minimum. 

Remark a: For a fixed opponent profile Qi(jH,s-i) represents the 
maximum available quantity at a bid price of p<. The intuition behind 
PSP is an exclusion-compensation principle: player i pays for his allocation 



24 

SUBSTITUTE SHEET (RULE 26) 



WO 01/88811 



PCT/US01/15424 



so as to exactly cover The "social opportunity cost" which is given by the de- 
clared willingness to pay (bids) of the users who are excluded by t's presence 
(see Figure 1), and thus also compensates the seller for the maximum lost 
potential revenue. Note that this amounts to implicitly assuming that the 
bid price accurately reflects the marginal valuation 9\ on the range [o^gj. 
In other words, by this rule the auctioneer is saying to the player: "if you 
bid (qi 9 pi)j I take it to mean that in the vicinity of 0» can be approxi- 
mated by a line of slope p,-. 77 This is the (built-in) transformation from the 
direct-revelation mechanism to the desired message process discussed in the 
second remark at the end of Section 2.1. 

The charge a increases with a Y - in a manner similar to the income tax in 
a progressive tax system. For a fixed opponent profile s_,-, imagine player i 
is increasing starting from 0. The first few units that player i gets will be 
taken away from the lowest clearing opponent (i.e. m = argmin,{p/ : aj > 
0}), and player t will pay a price (marginal cost) pm per unit. When am 
reaches 0, the subsequent units that player i gets will cost him p m / > p m , 
where m f is the new lowest clearing player, the one just above m. The 
PSP rule is the natural generalization of second-price auctions (or Vickrey 
auctions). In a Vickrey auction of a single non-dixrisible object, each player 
submits a sealed hid. and the object is sold to the highest bidder at the bid 
price of the second highest bidder, which is what happens here if q< = Q> Vi. 
This is widely known ro have many desirable properties [35, 24, 4], the most 
important of which is that it has an equilibrium profile where all players bid 
their true valuation. As we will presently show, this property is preserved 
by the PSP rule in the more general case of sharing an arbitrarily divisible 
resource, and this leads to stability (Nash equilibrium). The PSP rule is 
analogous to Clarke- Groves mechanisms [3, 8, 20] in the direct-revelation 
case. 

Remark b: When two players bid at exactly the same price, and they 
are asking for more than is available at that price, (2) punishes both of them. 
For example, if Q = 100 and Sj = (4,60) and s 2 = (4,70), the allocations 
would be a t =r 60 A ( 100 - 70) == 30, and a 2 = 70 A (100 - 60) = 40. Since 
the bid prices are equal, there is no "right 77 way to decide who to give the 
remaining capacity to. One could divide it equally, or proportionally to their 
requests, etc. For the subsequent analysis, it turns out it is simpler to not 
give it to either one (of course, it will be allocated to the lower bidders if 
there are any). This is just a technicality since by deciding this, we ensure 
that it will never happen (at equilibrium), since the users will always prefer 
to change their prices and/or reduce their quantity. 
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Considering the computational complexity of PSP, a straightforward im- 
plementation would at worst, sort the bids in time /log/, perform (2) in 
linear time, and (3) can be done in time Z 2 . Tins, the complexity of com- 
puting the allocations is 0{P). 

3 Analysis of the Progressive Second Price Auc- 
tion 

3.1 User Preferences 

Since the allocation rule A is given by design, the only analytical assump- 
tions we make is on the form of the players 9 preferences. 
Player Vs preferences are given by his utility function 

n,- : S — ► (-00,00) 
s 1 — ► tt»(a). 

Player j has a valuation of the resource 0i(ai(s)) > 0, which is the total 
value to her of her allocation. Thus, for a bid profile of j, under allocation 
rule A r player i getting, an allocation A{(s) has the quasi-linear utility 

= 4(oi(j)) - Ci(s) (4) 

which is simply the value of what she gets minus the cost. 

In addition, the player can be constrained by a budget 6 t - € [0, 00]. so 
the bid s f - must lie in the set 

5 t (s_*) = {Si € Si : Ci(s { ; < (5) 

In the proofs of the following section, we will assume that users have 
elastic demand, that is: 

Assumption 1 For any i € Z, 

• 0i(O) = 0. 

• Bi is difffirntiable, 

• > 0, non-increasing and continuous 

• 37,. > 0. Vr > 0, 6\{z) > 0 => V17 < z y 0<(z) < 6\(n) - 7i (z - 17). 
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The last item says that as long as the valuation is strictly increasing, it must 
also be strictly concave (with minimum curvature 7,). However, it is allowed 
to "flatten" beyond a certain amount of resource. 

Functions of this (concave) form have wide applicability as models of 
resource valuation, and can be justified from the economic standpoint (di- 
minishing returns) as well as from information theoretic standpoint - see 
Appendix A. For examples of valuations satisfying Assumption 1, see Sec- 
tion 3.4 and Appendix A. 

3.2 Equilibrium of PSP 

The auction game is given by ((?,tfi,...,u/,-rl), that is, by specifying the 
resource, the players, and a feasible allocation rule. We analyze it as a 
strategic game of complete information [4]. 

Define the set of best replies to a profile of opponents bids: S*(s-j) — 
{si 6 Si(s-i) : Ui(si;s-i) > Vj$ 6 £*(«-*)}. Let S'(s) = Ui $i (*-*). 

A Nash equilibrium is a fixed point of the point-to-set mapping 5% i.e. a 
profile 3 € S m (s), In other words, it is a point from which no player will 
want to unilaterally deviate. Such a point is what is most accepted as a 
consistent prediction of the actual outcome of a game, and has been re- 
peatedly confirmed by experiments, as well as a wide range of theoretical 
approaches. Indeed, in a dynamic game, where players recompute the best 
response to the current strategy profile of their opponents, this iteration 
can only converge to a Nash equilibrium (if it converges at all). In addition, 
an important trend in modern game theory is the development of learning 
models, and there too, it has been shown that Nash equilibria result also 
from rational learning through repeated play among the same players [13]. 

A more general (and hence weaker) notion of stability is the existence 
of an €-Nash equilibrium. Let the e-best replies be Sf($~i) = {s< € Si(s_ t ) : 
Vi(si;s~i) > ui(s<;s~i) - e,Vs< € Si(s-i)}* An «-Nash equilibrium is a fixed 
point of S € . 

In a dynamic auction game, e > 0 can be interpreted as a bid fee paid 
by a bidder each time they submit a bid. Thus, the user will send a best 
reply bid as long as it improves her current utility by €, and the game can 
only end at an f-Xash equilibrium. 

Define 

/>•(*, s_ f -J = inf {y > 0 : <?,(y, > z} . (6) 
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Thus, for fixed s-i, Vy,z > 0, 

z < Qi(y, *-0 =>y> *-0 W 

y> Pi{z, s-i) =>z< Qi{y,s-i)- ( 8 ) 
The graph of Pi(.,s-i) is the "staircase" shown in Figure 1, and that of 
Qi(.,s-i) is obtained by flipping it 90 degrees. 
It is readily- apparent that 

<,(*) = r M Piiz^dz. 0) 
Jo 

The key property of PSP is that, for a given opponent profile, a player 
cannot do much better than simply tell the truth, which in this settmg means 
bidding at a price equal to the marginal valuation, i.e. set pi = «K«J. By 
doing so, she can always get within e> 0 of the best utility. 

Let % = {Si € Si : Pi = «K«)}, ^e (unconstrained) set of player t s 
truthful bids, and T = H r^i- 

Proposition 1 /Incentive compatibility^ Under /Issumpiton J, Vt € I, 
Vs., € S-i, such that Qi(0,s-i) = 0, for any € > 0, there exists a truthful 
(-best reply r,(s_ f ) € % n Sfts-.)- 
/n particular, let 

Gi(s.i) = {z6[0,Q] : :< QM*),s-i) and £ Pi(n,s-i) dn < 6,j . 
Then with Vi = [rap <?.•(«-<) - c/^(0)] + <md = «<(i> f ), = (*.«») 6" 

7-nS/(s_,). 

The truthful best reply can be found in a straightforward manner, as 
illustrated in Figure 2. 

Proof: Fix s_i € 5— Let z, = sup <?<(«-<) and y, = J- 

By definition of 3{*(n)} C G<(*-.) such that lim„*(n) = «. Hence 6, > 
lim. r (n) Pdn.s-i)dv = /''Pi^s-Odn > c,(ti;s-i), where the equality come 
ta the boundiness of pfand She Lebesgue dominated convergence theorem, and 
the second inequality from (9) and (2). Thus t, 6 TnS,(«-<)- 

s Actually, since <?.(., s-0 is upper-semi-continuoiis (j"™!* «P)» "* have 2 - 
Qi(tf.'-i) - * 2 Pii','-) 



28 

SUBSTITUTE SHEET (RULE 26) 



WO 01/88811 



PCTAJS01/15424 




Q 



Figure 2: Truthful c-best reply 



Next we show that U € 5/(*-0- First, Zi = limn *(n) < l™n Q< W ( r ( n ))> < 
0,'(Jim„ *{(*(*)), *-*')• where the inequalities follow respectively from r(n) € <?<(«-* )i 
and the upper semi-continuity of Qi(. 9 s-i). Now by the continuity of F Q» (lim„ 0 g '(*(n)), s_.) = 
Qi(8l(zi),s-i) = Qifcfc,*-.), hence 



Now, we claim that tuft; s_.) = t*. Indeed, if 0 then v B - = 0 and Qi(U; = 0. 
If zi > 0, then by (10), Qi(y if s-i) > 0 and since by hypothesis Qi(0,*-0 = 0, we 
have eiizt) = y, > 0. Also, Zi > 0 implies ti* < z t . Therefore, by Assumption 1, 
we have Wi = 0J(v#) > 0-(xi) = j«. Hence, since Q.(.,a-») is non-decreasing, 

Q.(m a #-<) > iim,N»3 f (ir.*-<) = Q<(w»*-<) > * > «*• Thus > b y < 2 >t 



(10) 



(ID 



Now Vsi e 5f(«-t), 
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> [" ( ' ) [P i (z,s. i )-9' i (z)]dz- ( (12) 

where the inequality follows from (r< - v s ) < e/B^O) and the fact that <?,< is non- 
increasing- Thus, it suffices to show that the integral is > 0. 

If zi < a,(s), take any z € (*,o<(«)]. By the definition of r,-, z g G,(s-.). Now 
5i 6 S,(*- 4 ) implies 6,- > a(s) = .-,) eft, > £ Therefore 

we must have z > (?,(*<(*)), which by (8), implies O^z) < Pi(z) and the mtegrand 
in (12) is > 0 as desired. 

Suppose zi > ai(si). Since ^ is non-increasing, Qi(.,*-0 is non-decreasing and 
Pi(.,s-i) > 0, any point to the left of Zi is in the set Gi(s-i), Vz < € (?*($-<)« 
hence z < Qf(4(j) t *-0 which by (7), implies «(*) > *(*,*-.*), *> integrand 
in (12) is < 0 as desired. D 

Figure 3 shows the utility function of player 4, u A (s 4 ), in a PSP auction- 
with / = 5 players, with *_4 fixed, and a valuation 0 4 (q) = 10g. The plateaus 
correspond to the points where q 4 > Q 4 (p 4j s) = [Q - £{j>Pi>i*t} a i( 5 )] » 
and a 4 (s) can no longer be increased at that bid price - see (2). At bid prices 
p 4 > ps, the utility decreases when a 4 > Q - 9s, because after that point, 
each additional unit of resource is taken away from player 5, and thus costs 
ps 9 which is more than ff A its value to player t. Thus, each additional unit 
starts bringing negative utility. This is what discourages users from bidding 
above their valuation. Proposition 1 is illustrated by the fact that for any 
given quantity q 4 , the utility u 4 is maximized on the plane p 4 = ff 4 s 10. 

Remark: When the players have linear valuations and no budget con- 
straint (6; = oo). PSP becomes identical to a second-price auction for a 
non-divisible object. Then the existence of a Nash equilibrium follows di- 
rectly from incentive compatibility. 

Note that in PSP, the incentive compatibility (optimality of truth-telling) 
is in the price dimension, for a given quantity. With the message space we 
Have designed, there is no single "true" quantity to declare, the optimal 
quantity depends on opponent bid prices. Were the message process such 
that players declared a price and a budget (rather than desired quantity), 
it may have been possible to design an allocation rule A such that they are 
inclined to reveal their true budget, thus obtaining incentive compatibility in 
both dimensions- and hence equilibrium. But such a rule A would likely not 
have a simple closed form like (2) and (3). In essence, the computational load 
of translating budgets into, shares would be centralized at the auctioneer, 
thus making the system less scalable to large numbers of users. On the 
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Figure 3: Utility u 4 (s 4 ) for s t = (100, 1), s 2 = (10,2), * 3 = (20,4), s s = 
(20 ? 7),* 6 = (30,12) 

other hand, decentralization has a cost too, which is the signaling overhead 
resulting from players possibly adjusting bids based on opponent bids in the 
iterated game. Our design is based on the premise that the latter approach 
is the more scalable of the two (indeed that was the reason for choosing a 
small message space). 

The next property is that the truthful best reply is continuous in op- 
ponent profiles (this can be seen in Figure 2: as the "staircase 77 is varied 
smoothly, the point of intersection with 0* moves smoothly, provided 6$ is 
not flat - which is given by the last time item in Assumption 1). To prove 
that, we will need the following: 

Lemma 1 Vs,*' 6 5,Vy,2> 0,W > 0, if - sL { \\ < 6 then 

Qi(y + 6. s-i ) + 6Vl> Qi{y, *'_,-) > Qi(v - * ) - *vT f (13) 

and ^ ^ Sy/I.s-i) + S> Pi(z,sLi) > ~ - «- (14) 

Proof: First, ||s. f -V f I| < 6 implies 1^-9*1 < and p k +6 > p> k > p k S. 
Thus, Z k qkli Pk +6>y) +<V7 > E*^V fc >v> * Ht** l <P>-*>v) Then ' 
using (1) and the identity (a + 6)+ < (a)+ + (6)+, the first result follows. 
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For any y < P,(z, sL { ), by (7), we have z > <?.(y, si,) > Qi(y - 6,*-,) - 6y/7, 
which by (8), implies y - 6 < Pi(z + 6>/j,s_ f ). Letting y / -Pi(z,si £ )> we get 

For any y > Pi(z, 51,), by (8), we have r < Q,(y, si,) < Q.(y + *-<) + «V7, 
which by (7) implies y + 6 > Pi{z - 6\/7). Letting y \ Pi(z,*Li), we get 
*(*,*!.•) >iH*-^)-*- ° 

Lemma 2 (Continuity of best reply) Under Assumption 1, Vi € I, tAc e-tesi 
rep/y *,* ^iven in Proposition 1 is continuous in s—i on any subset V{(P y ~P) = 
{s-i € Si : Vz > 0,P > P;(z,5. f ) > £}, uriiA oo > F > £ > 0. 

Proof: Let Zi = supG t -(s-f). We will show z+ is continuous, and the continuity of 
v . — [z< — e/0<(O)] + and Wi = 0$(v.) follow immediately (recall that by Assumption 1, 
6i is continuous). 

Suppose there is a discontinuity at some s~i. Then, 3f j > 0, such that V£ > 0, 
3*L tf € Vi(£, P) with ||5_i - 5l,H < * and |fr - z*|| > c lt where zj = sup Gf(4J. 

Suppose Zi + €i<z< s= supGf(*L,-) (the case zj + *i < z f is handled identically, 
with s-i and il a - interchanged. ). Consider the definition of G%\ since €\ is decreas- 
ing and Q<(.,s_ t ) is non-decreasing and Pi(.,s-i) > 0, any point to the left of zj is 
in the set G,(sl_ s ), therefore 



Also, by (7) zi + c x < QdBfci + € i ), s»_i) => *J(*. + * i ) > + <i , Now 

since ^L; e V5(,P, P), this last expression is > P > 0, hence + «i) > 0. Then 

using Assumption 1, + c 2 ) > *<(*•■ + *i) + 7i(«i - «*) > + «i) + *i for 

S < *i = J- A € t 7f, and 0 < e 2 < (d - S X VI) A (Ci - <$i/7i)- Therefore, since 
Qi(.,s-i) is non-decreasing, 



Thus, z, + ci < <2i(*<(z, + ei). jL,) = [O-E* 9tl<Pt>*i(-+<i)}] + - Therefore, 
Zi + ci < + *i) + *-*) + 



(15) 



using Lemma 1. 



Zi + c 2 < + *a), s_i) + * - ci + «2 

< G.-(4(*+'3)>*-<)- 



(16) 



Now (15) also implies that 
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and this holds Ve 3 < ci- Now, using Lemma 1, 
[/*(*. *L,) -Piil, *-<)] *I 

> -6Q+[' Pi(ji-£Vl,s.i)d v - P^s-^dr) 

Jo Jo 

> -(Q + P)*Vl. 

Let < 2 = (5^7?» 311,1 €3 ^ that <>< e 3 < - (0 +?)fcV?/£- 

Jo 

fot Now dioosing 6<6i*62, (16) and (17) imply that Gi{s-i) B (*-H 3 )A(*.-K 3 ) > 
r,- = sup d(s-i), a contradiction. 

We introduce one additional player, player 0, whose valuation is 0 0 (z) = 
vaz, and whose bid can therefore be fixed at s 0 = (go.Po) = WiW 
Ocan be viewed as the auctioneer, and po > 0 as a Reserve price at which 
the seller is willing to "buy" all of the resource from himself. Prom (1), the 
presence of the bid s Q = (<?,Po) implies V* € Z,Q<(f ,«-*)*' 0,V» < Po- 1» 
particular, setting y = 0, the condition of Proposition 1 holds. Thus .we can 
restrict our attention to truthful strategies only, and still have feasible best 
replies. This forms a -truthful" game embedded within the larger auction 
game, where the strategy space is 7" C «S, the feasible sets are % n S^-i), 
and the best replies are *?(*) = * n *K">- A ^ a T . 18 a 

fixed point of 5' in 5. Thus an equilibrium of the embedded game is an 

equilibrium of the whole game. 

Proposition 2 (Nash equilibrium) In the auction game vnth * Ae /f^™J e 
given by (2) and (S) and a reserve price po > 0, and players described by 1(4. ) 
and (5), if Assumption J holds, then for any € > 0, there exists a truthful 
e-Nash equilibrium s" £ T. 

Proof: V* € T, Vi 6 I. V? > 0, we have z > 0 = Qiipo/2,s-i), which by (8) 
implies ft(z, s-0 > po/2 = Let P = max t€lu < 0} *i(0). Then 'he conditions 
of Lemma 2 are satisfied and * = (v, w) is continuous in s on T. By Assump- 
tion 1, % is continuous therefore v(g,p) = v(,,fr"( 9 )) (as defined in Proposition 1), 
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can be viewed as a continuous mapping of [Q t Q] f onto itself. By'Brouwer's fixed- 
point theorem (see for example [10]), any continuous mapping of a convex compact 
set into itself has at least one fixed point, i.e. 3q m = v(q*) € [0,<3] 7 . Now with 
s* ss (<Ti W)), we have s* = i(s*) €T. ° 



3.3 Efficiency 

The objective in designing the auction is that, at equilbrium, resources al- 
ways go to those who value them most. Indeed, the PSP mechanism does 
have that property. This can he loosely argued as follows: for each player, 
the marginal valuation is never greater than the bid price of any opponent 
who is getting a non-zero allocation. Thus, whenever there is a player j 
whose marginal valuation is less than player i's and j is getting a non- 
zero allocation, i can take some away from j, paying a price less than t's 
marginal valuation, i.e. increasing t*i, but also increasing the total value, 
since t's marginal value is greater. Thus at equilibrium, i.e. when no one 
can unilaterally increase their utility, the total value is maximized. Formally, 
consider a € argma3U £i*i(*0- The Karush-Kuhn-Tucker [17] optimality 
conditions are that there exists a Lagrange multiplier A such that 0£(oO = A, 
if ai > 0, and ^(0) < A, if <n = 0. 

Assumption 2 For any i € X, ft/= oo, and B\ satisfied* 

flW-tiCO >-*(*- A 

whenever z > z' > 0. 

Given any e > 0, for any 6-Nash«equilbrium s m € T, let a* s and 
let a* = nunf 6 3u{o}A->o a i i the smallest non-zero allocation. The following 
is the tt € version* of the Karush-Kuhn- Tucker conditions. 

Lemma 3 Suppose Assumptions 1 and 2 hold. If for some j, a] > \/€/k, 
thenVielUiQ}, 

An immediate corollary is that ifa m > y/e/K then 
A- - 2jeZ < flj(*?) < A* + 
6 If B\ is differeniiable, the condition is 0 > tff > -x . 
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if a m > y/ejn and 

0<(a?) < A* 4- 2Vi£, 

if a* = 0, for some A* > 0. 

Proof: Suppose *{(*?) > ^(aj) + 2V^- Since *J(aJ) > 6'^) > pj, we have 

Since a? > 0, if player i bids at a price above pj, he can take all of player j's 
allocation, without losing anything of his own, i.e. a* + a) < Qi(pJ,$L,). By (7), 
this implies 

Pj >«(a? + 
Let g,- = (a? + y/T/n) and Si = (ft, Then 

> ['.<<«?) - *v^A - pj] 



which contradicts the fact that 5* is an e-Nash equilibrium. D 

Proposition 3 (Efficiency) Suppose Assumptions 1 and 2 hold. If a* > 
y/Tfk, then _ 

where A = {aG [0, <?] /+1 : < hi- 

proof: (of Proposition 3) Let 1+ = \k : a k > aj} and X~ = {* : o fc < aj}. For 
i 6 T+, we have ^(aj) < A" + 2v"«. For i € X", we have oj > a* > 0, therefore 
by the lemma, fl}(aj) > A* - 2V?*. Therefore, 

5><*j - *« ) < Z^( a ?)( a » - a <) ~ 

< (A* + 2v^T) A - (A* - 2v/^)A, 

where A = £1* (a* - a i ? ) = £x-( a ? "~ Since A < Q the result follows, with 
the bound 4Qy/oc. ° 
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Remark a: The condition 6 t - = oo is sufficient, but not necessary to 
achieve efficient outcomes. In fact with any budget profile, efficiency can 
be achieved if the users cooperate. For example, if they all choose a bid 
quantity close to what they can actually obtain (which they do if they use 
the strategy given by Proposition 1), then the price paid would be po per 
unit for all the allocations, and if p 0 or the shares a? are not too large, 
then budget constraints are irrelevant and a* is efficient. More generally, 
efficiency is attained if the budgets are not too far out of line with the 
valuations, i.e. there are no players with very high demand and very low 
budget. 

Remark b: ( Welfare and Efficiency) A more common measure of effi- 
ciency is the social welfare, which is the sum of all the players' utility J2i *ii 
including the seller % = 0. The natural definition, of the seller's utility is the 
value of the leftover capacity ao = Q - Hi^o °t P lus the revenue, i.e. 

uo ss. 4d(oo) + X) 

Then, r. a, = - a) + «o = + *o(oo) = Et ft- Thus, £i u* 

is equivalent to the efficiency measure used above, which is Another 
measure is the seller's revenue. Even though PSP is not, in general, revenue- 
maximizing. it tends to the revenue maximizing allocations and prices as 
demand increases [18]. 

Remark c: Proposition 3 provides a key to understanding the basic 
trade-off between engineering and economic efficiency. The smaller €, the 
closer we get to the value-optimal allocations. But in a dynamic game, 
where players iteratively adjust their bids to the opponent profile, a player 
will bid as long as he can gain at least e utility (since that is the cost of 
the bid), thus a smaller . € makes the iteration take longer to converge, i.e. 
entails more signaling. 

3*4 Convergence 

An issue of obvious concern is whether the game converges under dynamic 
play: it turns out that it does, when users behave rationally (see Proposition 
4 in Chapter 2 of [29]). Moreover, irrational or malicious behavior - like 
intentionally trying to prevent convergence by making unnecessary bids - 
can always be controlled by setting the bid fee e high enough to make such 
behavior prohibitively costly for the culprit. 
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Figure 4: Parabolic valuation with K = 0.5 and ? = 70 

Another issue is how the convergence time scales with the number of 
bidders. We now consider this experimentally using the software described 
in Appendix B. 

In all our simulations we let Q = 100. For each user, the valuation is 
strictly increasing and concave up to a maximum corresponding to a physical 
line capacity, and flat beyond that. Since, as shown by Proposition 3, only 
the second derivative of the valuation is needed to measure the efficiency 
of the PSP auction, a second order (parabolic) model is deemed sufficient. 
Thus we use valuations of the form: 

9i(z) = -Ki{z A ? t ) 2 /2 + A f(), 

where is the line rate, and m > 0 (see Figure 4). 

We generate our user population with independent random variables 
■f#(0)}r (corresponding to the maximum unit price the user would pay) 
uniformly distributed on [10,20], and = fl<(0)/? if and f, uniformly dis- 
tributed on [50, 100]. All players have a budget b { = 100. The bid fee is 
fixed at c = 5. Each user has a bidding agent which can submit at most one 
bid per second (see Algorithm 1 in Appendix B). 

With this set-up, the results are shown in Figures 5-6. Simulations were 
run for 11 population sizes ranging from 2 to 96 players. Each point is 
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Figure 5: Mean (+/- std. dev.) number of bids - solid line. The dashed 
line is I+P/IO. 

simulated 10 times with new random valuations for all players. The overall 
mean is 11.9 bids per player. Prom Figure 5, the number of bids seems to 
grow as the square of the number of players. 

The actual time to converge, shown in Figure 6, grows more slowly, since 
the computation of bids is done jn parallel by all the players. In fact, for 
small numbers of players, the time decreases. This can best be explained as 
follows. Suppose there are only two players, with similar valuations. They 
will both start by asking for their maximum quantity, at their marginal 
valuation (which at their maximum quantity is near zero). Then as each 
sees the other's bid, each will reduce the quantity and increase the price a 
little bit. And they go on taking turns, gradually raising the market price 
until they reach an equilibrium. However if there are 10 players, in between 
two bids by the same player, the 9 others will already have bid up the 
price, so he will jump to higher price than if there was only one opponent. 
Thus the equilibrium market price will be reached more quickly. For large 
populations, this effect becomes small compared to the sheer volume of bids, 
and the convergence time starts to grow. 

The trade-off between signaling and economic efficiency discussed in light 
of Proposition 3 is illustrated' by Figures 7-8. Increasing the bid fee speeds 
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Figure 6: Mean (+/- std. dev.) convergence time in seconds (for a 1 second 
bid interval). 

up convergence, at a cost of lost efficiency. A resource manager should select 
a bid fee which optimally balances the two for the particular context. 

Figure 8 also illustrates the validity of the lower bound given by Propo- 
sition 3. 

4 Conclusion 

Auctions are one of oldest surviving classes of economic insti- 
tutions [...] As impressive as the historical longevity is the re- 
markable range of situations in which they are currently used. 
[22] 

We proposed the progressive second price auction, a new auction which 
generalizes key properties of traditional single non-divisible object auctions 
to the case where an arbitrarily divisible resource is to be shared. We have 
shown that our auction rule, assuming an elastic-demand model of user pref- 
erences, constitutes a stable and efficient allocation and pricing mechanism 
•n a network context. Even though we are motivated by problems of band- 
width and buffer space reservation in a communication network, the auction 
was formulated in a manner which is generic enough for use in a wide range 
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of situations. In the sequel to this work, we show that the key results - 
namely incentive compatibility, equilibrium, and efficiency - generate to a 
setting where multiple networked resources are auctioned, with users bid- 
ding on arbitrary but fixed routes and topologies [29, 31]. In related work, 
we consider the case of stochastically arriving players bidding for advance 
reservations (i.e. resources for a given period of time) [29, 30]. 

An interesting direction of future work is learning strategies, and evolu- 
tionary behavior which can emerge from repeated inter-action between the 
same players. 
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A Information-theoretic basis for the valuation 

In general, valuations are simply assumed to be given as external factors. 
Indeed, the fundamental assumption in any market theory is that buyers 
know what the goods are worth to them. The "elastic demand 7 * or u di- 
minishing returns" nature of Assumption 1 is fully justified from a purely 
economic standpoint for virtually all resources in everyday life. 

In the case of variable bandwidth, we can go even further by better quan- 
tifying what the goods are. For a user sending video, say, how much value is 
lost when the channel capacity goes from 1.5 to 1.2 Mbps? Ultimately, the 
value lies not in the amount of raw bandwidth but in the information that is 
successfully sent. Our goal in this section is to give a brief description of how 
Information Theory can be used for a bottom-up construction of bandwidth 
valuations - based on the fundamental thing the user cares about which is 
communication of information - and that such valuations will generally be 
of the type in Assumption 1. ' 

Any information source has a function !?(.)» such that when compressed 
to a rate R, the signal has a distortion of at least D(R) [1]. The distor- 
tion is the least possible expected "distance" between the original and com- 
pressed signals, where the minimization is over all possible coding/decoding 
schemes. In this context, we make the distance measure the monetary cost 
of the error. This cost can be chosen, for example, to be proportional to 
some common measures like the mean squared error, the Hamming distance 
(probability of error), the maximum error, etc., or heuristic measures based 
on experiments with human perception. Given that modern source-coding 
techniques can, given a distortion measure, achieve distortions close to the 
theoretical lower bound[5], it is not unreasonable to use the rate-distortion 
curve as an indication of the value of the bandwidth share. 

Let Ui(0 he the distortion-rate function of {JCi(t)}, a stochastic process 
modeling the source of information associated with user t. X{ is encoded as 
Y{ which has a rate of R bits per second. 

Shannon's channel coding theorem [32] states that Y{ can be received 
without errors if and only if the channel has a capacity C > R. In our 
auction context, user i has capacity (bandwidth allocation) C = a,-, and 
thus has to suffer a distortion of at least 2?,'(a,-). The value of the bandwidth 
is then 

ftto) « Afe), (18) 
where 0; is the value of the full information. 
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The relevant properties of the distortion-rate functions are: 

♦ when the rate is greater than the entropy of the source, the distortion 
is zero, and 

• for many common source models and cost functions, the distortion- 
rate function is convex, and has a continuous derivative. 

It is easy to see that, with these properties. (18) satisfies Assumption 1. 

Example 1: Let {X} be a Bernoulli source, taking two values with 
probabilities p and 1 Without loss of generality, let p € [0, 1/2]. In this 
case, since the source is ia.d, one can define the distortion on a per symbol 
basis. Using a Hamming cost function 

d(jr,y) = i { Aw}> 

i.e. assuming it costs one unit of money, every time one bit is wrong, we have 
the distortion D = Ed(X,Y) = P{X ? Y). the rate- distortion function is 

where H(x) = -xlog(x)-(l-a;)log(l-x h and the distortion-rate fimction 
is the inverse function. It can be easily seen that D{R) is strictly convex and 
decreasing for 0 < R < H{p), and V(R) = 0 for R > H{p). The continuity 
of D' on 0 < R < H(p) and R > E(p) is obvious. At the critical point 
(R = H (p), D - 0), we have 

lim D'(R) = lim l/R\D) 

= lim l/log(2?/l- D) 

= 0 

= lim D'{R). 
R\H(P) 

Thus continuity of D' holds throughout, and Assumption 1 is valid for the 
valuation of the form (18) for this source - see Figure 9. 

Example 2: Let {X} be a Gaussian source with Markovian time- 
dependency, i.e a covariance matrix $ = ( a7r ^) iJ » r € [0, 1). Suppose we 
jse the squared error cost, i.e. it costs one unit of money for one unit of en- 
ergy in the error signal. Then, we have for low distortions D < ( 1 -r)/(l+r), 
fl(D)= |log^ ; or 

D(R) = (l-r 2 <r 2 2- 2 * 
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Figure 9: Distortion-rate based valuation for a Bernoulli p = 1/2 source 

and Assumption 1 clearly holds for (18). In the i.i.d. case (r = 0), the 
formula holds for all R. 

As the source models get more complex, it rapidly becomes impossible 
to give closed-form expressions for either R{D) or D(R). Often parametric 
forms are available, and the functions can be evaluated numerically. Fortu- 
nately, the convexity property extends to a wide class of models, including 
for example auto-regressive sources, even when the generating sequence is 
non-Gaussian (see (1] for a full treatment of R(JD)> including the above 
cases). 

It can happen, e.g. for some video source models, that the R-D curve, 
which gives the best (RJD) pairs achievable by any coder/decoder, is not 
convex. But. for tractability, practical codecs are usually optimized on a 
convex hull of the space of possible (R,D) pairs[26, 27]. Thus, even when 
the theoretical D(R) curve is not convex, the actual distortion achieved in 
real-life systems almost always varies in a convex manner with the available 
bandwidth. 
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B Simulation software and bidding algorithm 

A prototype software agent based implementation of the auction game, 
called TREX, has been developed and extensively used since December 1995. 
Much of the intuition behind the mechanism design and the analysis in this 
work came from experiments done on this inter-active distributed auction 
game on the World Wide Web, using the Java programming language. The 
game can be played in real-time by any number of players from anywhere 
on the Internet [28] . 

Each user plays in the dynamic auction game using the following: 

Algorithm 1 1 Let s { = 0, and = 0. Start an independent 
thread which receives updates of 

2 Compute the truthful €-best-reply of Proposition 1, U € n 
.9/(5-0- 

3 Ifui{U\ 5-<) > Ut-(*f;£-i') + then send the bid s { = U. 

4 Sleep for 1 second. 

5 Go to 2. 

No assumption is made on the order of the turns. Players join the game 
at different times, and depending on the execution context of the client 
program, the sleep time of 1 second is more or less approximate. This, along 
with communication delays which make the times at which bids arrive at 
the server and updates at the clients essentially random times, make the 
distributed game completely asynchronous. 

Algorithm 1 can be described as selfish and short-sighted. Selfish because 
it will submit a new bid if and only if it can improve it's own utility (by more 
than the fee for the bid). Thus, the game can only converge 7 to an e-Nash 
equilibrium. And short-sighted because it does not take the extensive form 
of the game into account, i.e. does not iise strategies which may result in a 
temporary loss but a better utility in the long run. 
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Chapter 4 

Spot and Derivative Markets in Admission 
Control 

Xet game now risk, G. [81] 

In Chapters 2 and 3, we proposed a pure market approach for bandwidth pricing, 
where allocated capacities may vary during the lifetime of a flow, as players compete 
for resources through an auction game. 

In this chapter, we propose a mechanism for circuit switched calls, wherein calls 
are admitted or rejected at (or soon after) their arrival time, and if admitted, get 
a fixed allocation of capacity, and have the option of securing the resource at a 
guaranteed maximum price for a guaranteed minimum duration. Thus the charge 
has two components: 

• a market-based usage charge, where the user continuously pays the "instan- 
taneous" market price (determined by second price auctions among recent 
arrivals); if the market price exceeds a user"* bid price, that user is dropped 
unless 

• upon arrival, the user pays a reservation fee i buys an option contract), which 
gives him llir right to buy the capacity at any time in the future up to a 
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specified duration at his bid price. The user pays the market price as long as 
it is below his bid price. If at some point during the call, the market price 
exceeds the bid price, then the user automatically exercises the option, i.e., 
remains connected while paying no more than the bid price. 

In other words, we introduce a "derivative" instrument to reduce the uncertainty 
inherent in the "spot" market mechanism. Naturally, this contract (or reservation) 
must itself be sold for a -fair" price. In the context of financial markets, the fair price 
is calculated with the Black-Scholes approach [8, 65), which is based on the idea that 
the option must be priced such that a perfectly hedged (i.e., riskless) combination 
of the derivative and the underlying equity will provide (locally in time) the same 
expected return as a risk free security [36] (otherwise the derivative would present 
an arbitrage opportunity, i.e., unfair advantage, which would be exploited until its 
price rises). 

In our context, the contract differs in that, rather than the right to buy once at 
a given strike price, it gives the right to buy repeatedly over a given duration, i.e., 
it is a series of "call options". Here the fairness* we seek is that the reservation 
be priced in a way that reflects the probability that the system will become busier 
during the lifetime of :he reservation, in order to avoid individually rational but 
socially sub-optimal behaviour. Specifically, we seek to avoid customers connecting 
at low periods and making a reservation limiting their usage price to an excessively 
low price for a very long time, to avoid rejoining when the prices are higher 1 . 

A further difference from usual options is that, rather than the standard geo- 
metric Brownian.motio- model of e.g., a stock price, we have an underlying spot 
market process that is derived directly from our queueing system, via a heavy-traffic 

1 A simplified form of this arbitrage is commonly observed with flatmate priced dial-up Internet 
access. wImt#- some users remain logged on for very long periods of time even if they are not using 
the network, in order to av:;d the chance of getting busy signals when they need to be on. 



51 

SUBSTITUTE SHEET (RULE 26) 



WO 01/88811 



PCT/US01/15424 



arrivals Poisson — A. 
bid price — F — 



B lines 



batch ouput 
every t-exp/fci 
3 ofsizem-Pm 
3 1 <~ m «r= R — 

D 
D 



times-cxp/r 



C-oMsbnl bidden 



m-(C-o) owes bidden 



Clines 



O 
O 
O 
O 
O 

o 
o 
o 

o 
o 
o 
o 



Figure 4-1: Queueing Model 

approximation which leads to a diffusion process. 

From the point of view of .the user, this pricing mechanism is simply an initial 
reservation fee plus a per-minute (for example) usage charge. The novelty in the 
networking context is that both are market prices. 

In Section 4.1., we derive a heavy-traffic diffusion approximation model for the 
queueing system. Using that, in Section 4.2., we derive the corresponding model for 
the spot market price. Then in Section 4.3., we introduce our formulation of the 
reservation as a derivative financial instrument. Finally, in Section 4.4. we present 
some simulation results using a real trace of traffic at a dial-up Internet access 
modem pool. For the sake of readability, detailed calculations are relegated to 
Sections 4.5. and 4.6.. Although not strictly necessary, a layman's grasp of financial 
markets is helpful in reading this chapter. 



4.1. Queueing Model 
4.1.1 Preliminaries 

Customers arrive in a Poisson stream of rate A, with i.i.d. bid prices distributed 
according to a distribution F (density /). Call durations are exponentially dis- 
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STAGE I 



STAGE il 




Figure 4-2: Pricing Mechanism 

tributed with mean 1/r. The queueing system is shown in Figure 4-1. It consists of 
two stages, with buffers of size B and C res^-ectively. 

The first stage consists of a second price auction, where the winners enter the 
second stage, and the losers leave the system. Specifically, the first stage has an 
exponentially distributed service time r wi:i mean 1///- At a service completion 
instant, let m be the number of customers : — the first stage and n the number in 
the second stage, at that time. The m first s-j*ge customers are ranked according to 
their bid prices. The (C-n) highest bids are accepted into the second stage, and the 
remaining m - ( C - n ) are dropped. The bid z rice of the highest dropped bid defines 
the new spot market price, which is valid unvJ the next batch. If no customers are 
dropped, i.e., m < C - n, then the spot market price is zero. Figure 4-2 shows the 
functioning of the mechanism. 

The second stngo is a C server queue r-r.h no waiting room, where each cus- 
tomer has an exponential service time (call duration) of mean 1/r. The arrivals 
are according to *i hatch Poisson process, v--.;j rate /i, and batch sizes distributed 
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according to ^m}Lo- Let p = A//*. For 0 < m < 5, 




P{m arrivals in r} 



and 



oo 




In Section 4.1.2, the steady-state distribution of the occupancy of the second 



the reservations, we need a (probabilistic) model of its transient behaviour in the 
future, given :he current price. This is done in Section 4.1.3. 

4.1.2 Product-form solution 

Let the state of the second queue (admitted calls) be X = (X u . . . , Xk), where 
X k = (X k .J. • . . Xjc,M k )i X k ,i is the bid price of the i-th player in the fc-th batch, 
Mk is number of calls remaining from the Ar-th oldest batch, and K is the number 
of batches present. Let N = Y.1L1 M ^ the total number of active calls, the letters 
x,mjt, and r. -.vill denote generic values of X,Mk and iV, respectively. 

X(t) is a rontinuous time Markov process. The transitions are of two types. On 
one hand we have departures of individual customers: 



queue cai* be shown to be of product form. However, for the purpose of pricing 



.r = (x 1 ,...,xjb,...,arA') -> x' = (xi 



ei(xfc), r/v)- 



(4.1) 
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occuring with transition probability 

7(x,x') = r, 

where a{x k ) = (x w , . . . , x w+ i, . . . , x*, m J. On the other hand, we have batch 

arrivals: 

occuring with probability intensity 

7(x,x') = Mn K+ i II 
t=i 



if 0 < m/c+i < C — n, and 



7(x,x') = M 5" /? m F( min xjc+i,0 m ~ m * +1 r, : 77 II 

if rriK+i = C — n. 

Proposition 11 7%e equilibrium probability of being in state x = (xi, . . . , x#) is 



1+7) S^:S /(IW) ' (4 - 3) 



where the normalization constant is given by 



{r:0<».<C} 

C n BAnBA(»-m,) »A(„-£^m ( ) (_£_)" (ft)' 
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Proof: To prove the theorem, we first analyze and then conjecture the transition rates 
7* of the reversed process, X (-0* and then verify that the following balance conditions: 
Vx,x', 

7r(x) 7 (s, = *)i (4.4) 

and Vx, 

X>(s,*0 = £7*(s,*'), (4-5) 

^ x' 

hold. Indeed, if (4.4) and (4.5) are satisfied, then by Theorem 1.13 of [46], it is the 
equilibrium probability of the Markov process X(t). 

Conjecture: X(-t) = x has the following properties: 

• new batches are formed by customers arriving according to a Poisson process of rate 

• arrivals into existing batch k are according to a Poisson process of rate A*(x). 
Arriving customers have a bid price i.i.d. with distribution -F; 

• upon arrival into a batch- if the system is not full (n < C), the customer is placed 
. with equal probability in any position within the batch (from 1 to m* + 1). If the 

system is full, the new customer departs immediately; . 

• batch k has an exponentially distributed service time, with mean l//i*(x). 
The rates for the reverse process are the following. First, 

A 0 = /i(l- AO- 
Second, if n < C or k < A\ 
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and 

] M*)P * 1 ^ m k < B 
( 0 i£m h -B 

Third, when n = C, and k = A', i.e., for the last batch when the system is full, 
and 

A*(x) = rm K - MiK*)- t 4 - 6 ) 

Note that in this last case all the arrivals immediately depart. 

We begin by verifying (4.4). For the transitions of type (4.1). i.e., corresponding 
to departures in the forward process, consider a departure from the Ar-th batch. When 
m* > 1. (4.3) yields 

n(x u x k )t s= tc(x 1 , . . . , e,(x*), . . . 1 xk) j-^ ^ f( x k>i)r. 

Since m* - 1 < 5 - 1, we have /3 mjt -i = P mfc ~7(l + p) m *,and therefore Aifc(x') = r- m* - 
!)/(! _ p). On the right hand side the system is not full (n* = n - 1 < C - 1). hence 
A*(x') = /i*(x')/> = rfer(mib - 1). Thus we have 

Now, since arrivals in the reverse process take any position in their batch with equal proba- 
bility, Ak(x')/mk is precisely the arrival rate into position i in batch k. i.e., Xki^/mk/i^k^i) 
7"(x , .r ). as desired. If m* = 1. (4.3) yields 

« - • • » •••jiA* )** = *(*!, • - - , xjt+i, *K ) T+~p ^ fi x k.i)r 

= 7T ( X! , . . . , X*_ 1 , 3 jfe+1 , . . . 1 * K* ) Ao/( Xj(. ? i ), 
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as desired. 

For the transitions of type (4.2), if 1 < < C - T*k=i m *> 



as desired. If mjc+i = C — ]C*=i m *' 



= ir(*')wr+i(*')> 



B m! mK+1 

E AnF( x< ^ mjw8(m 1 roiw)l g /<■*+«) 



= ic(x')n K+ i(x'), 



as desired. 

.We now verify (4.5). The left-hand side is the total rate at which the forward process 
X(t) leaves state x, which is M (l - £o) +«r. The right-hand side is the total rate of leaving 
state i in the reverse process. Consider first a state where the system is not full (n < C). 
The total rate is then (letting k range from 1 to K) 



Ao+E **(*)+/*(*) = mi- AO + (i+/>) E ^(*) + E 

= f*a-A)+(i+p) E (i +P )m fc+ i \—) 

= /i(l-<3o) + £ ^m * = ^ 1 ~ /3o) + rn, 

it 



9 ) 
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as desired. Now when n = C, the rate is 



Jb=i 

= /z(l - 0o) + (n - m*)r + rmjc - «cOO + ^k(*)> 



where the second term is derived as in the n < C case above, and the third term comes 
from the definition (4.6). Then the right hand side is - A>) + nr, as desired. □ 



4.!L3 Diffusion approximation 

We develop a tractable approximation for the queue occupancy process N u by scal- 
ing the system following the well-known heavy traffic approach [37, 9, 97, 98, 18], 
applied here to a system with batch arrivals. 

Let a = m = EM = ESUoinAn, and v* £ E(M 2 ) = E^ 0 m 2 /? m . Consider 
a sequence of scaled systems, indexed by / = 1, 2, . . with capacity C<'> = la+^Vla, 
where 7 is arbitrary, and batch arrival rate ^ = tyt. Let 

the translated and scaled version of the queue occupancy of the /-th system. The 
drift and diffusion coefficient respectively are 



-rz if z < 7 



(4-8) 

-r(7 + V/or) if z > 7, 
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and 

(z t +*-z t ) |z p } = z 



■{ 



r(l + u 2 /™ + z/V^) if 2 < 7 (4 
r(l + if * > 7- 



The basic idea is that as / - co, Z t , (where =!► denotes weak convergence, 

i.e., the probability distributions converge), where Z t is the diffusion process which 
solves the stochastic differential equation 

dZt = a(Z t ) dt + a(Z t ) dW u (4.10) 

where W t is a Brownian motion. 

Note that, for 7 < oo, the offered load a^/C^ = (1 + t/^)" 1 / 1 » which 
makes it a "heavy traffic" approximation. 

The coefficients a(.) and <7 2 (.) are obtained by letting J - oo in (4.8) and (4.9) 
respectively. In the original approach of [37], Z t ( '> Z t holds if <z(.) and <r 2 (.) are 
continuous, which is clearly not the case here, since both have jumps at z = 7. 
. However, based on a theorem of Borovkov [9], Whitt [98] proposes a conditioning 
heuristic to derive approximate steady-state blocking probabilities for the G/GI/s/Q 
(i.e., s servers, no waiting room) queue from the diffusion approximation for the 
corresponding G/G//00 system. Our approach here will be to apply the heuristic 
to the diffusion process itself 2 , conditioning on Z t <i- 



2[98] p. 694 states that, for exponential service times, the heuristic is also applicable to the 
diffusion process itself. 
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4.2. Diffusion Models of the Market Prices 
4.2.1 Spot Price 

Suppose that at time i, the queue occupancy is N t = n, and a batch of size m arrives. 
The admission decision is made, and a new spot price results. In this section, we 
want to derive a model which will tell us in some sense the future evolution of this 
price. 

Consider the mechanism by which the price arises. Of the m new customers, 
those with the C - n highest bid prices will be admitted, and the spot price will 
be the (C - n + l)-th highest bid price. Recall that the bid prices are i.i.d., with 
distribution F, which we assume to be smooth. The probability that the price is x 
is the probability that in m draws, one will equal x, and of the remaining m - 1, 
C - n will be greater than x, and (m-l)-(C- n) will be less than or equal to x, 
which is 

= "/<*> (5 - , W( L m -" 1 } ) : ( c - ' f 11 - f ( ' )]C ~ • 

= (C - gpgi - c + .), ^*>r~~~' p - ^r" /(*>■ 

The distribution is 

F n , m (x) ^ f'uMdy 

JQ 

(C-n)\(m - 1 -C + n)! 



_ ^ UZl u m-C+n-l ( a _ u f-n ^ 

~ Jo (C -n)\(m-l-C + n)\ 



where the last equality comes from substituting the previous expression, and making 
the change of variables u = F(v/). <lu = f(y) dy. 

Let us consider a "first-order" approximation, the expected spot price given the 
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occupancy n and the size of the batch arrival m: 



= / / g(u^n^Tn)dudx 

Jo JF(x) 

= f l F~ l {u)g(u, n, m) du, 
Jo 



where 



^•^ m )=(C-n)!(m-l-C + n)! U (1 " } " ^ 

4.2-2 Diffusion model of the spot price 

For the purpose of pricing a reservation beginning at time t, we would like to char- 
acterize the future evolution of the market price, in terms of what is known at or 
just before i. namely the occupancy N t . Thus, it is. natural to consider the process 

B 
znssO 

Remark: We prefer the above expression to one that uses the information in 
the batch the: arrives at time since computing the option prices based on the new 
arrivals would add to the delay in the admission decision. By not waiting for the 
new informa-ion, we can use the time in between batches for computation of option 
prices. Of course the spot price results from the auction among the new bids, but 
that is a very simple computation. 

Now. we approximate the price by a sequence of functions of the centered and 
scaled process 
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Then, Ito's rule (see, e.g., [45]) along with (4.10) yields the stochastic differential 
equation for P: 



Since 



f^ a (0 + dt + d -^-^dW t . (4.12) 

dz ^2 dz 1 J <fe 



P (t) (z) = E A* T f" 1 («)p (0 («.^* + «'»m) **, 

m=0 • / ° 

where g w is defined as g with C<'> instead of C, it follows that 

^(z) = f ^ T F-*{u)£-Mu,V3* + af,m) At, (4.13) 

and 

= E A. f ^Wn^. VS7* + a/, m) A.. (4.14) 
a* m=0 1/0 oz 

Now (4.12), along with (4.8), (4.9), (4.13) and (4.14), constitutes a diffusion 
model for the spot market price P t , as / -* oo. From the results derived in Sec- 
tions 4.5 : and 4.6., we can approximate (4.12) by dP t = 0 when z < 7, and 



+y/rt(±P t -K[ m) )dW i ], (4.15) 



when : = 7. V is a constant related to the accuracy to which we want to evaluate 
g, and can typically can be taken to be 1 (see Section 4.6.). Vm, the constants 
A'}" 0 . /\'j m) , which depend only on the bid price distribution F . are denned (and 
can Ik- computed) by (4.23) and (4.25) respectively. 

Remark: z = 7 corresponds to .V, = C, i.e., the system is full. Thus we have 
a model where the spot price remains constant when the system is below capacity, 
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and varies when the system is full. This corresponds to what one would intuitively 
expect from the auction mechanism (with the constant price 0 when the system is 
not full). Note that we do not attempt to model the transitions between the two 
regimes, so this model is meaningful only during periods when the system is almost 
always full. 

4.3. Computing Reservation Fees: the Derivative Market 

Taking the current market price P t as initial condition, the solution of (4.15) provides 
a stochastic model of the future prices {P T } T > t (this solution is unique in distribution 
- see [45]). From that, we now determine the price of a reservation, arriving at time 
with a holding time T, and a bid price p. The value, or "fair price", at time t of 
an option to buy a security for a "strike price" p at a specific future date 3 r > t is 
(see [45, 36]) 

C T ^E(P T -p) + . 

In our context, this concept must be extended in the following straightforward 
manner: we define the reservation as a hold option, a new kind of derivative instru- 
ment which is an option to buy repeatedly at every time instant from t to i + T. 
Thus the reservation fee should be 

«t,T,p) (4.16) 

If A") = A 2 = 0, Cr is given explicitly by the Black-Scholes formula [36],. For the 

3 This is called a European option, as opposed to an American option, which is the right to buy 
at any time between t and r. 

Note thai we take the risk-free interest rate to be 0. 
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more general form (4.15), which we re-write as 

dP r = (AP r + K 3 ) dt + (DPr + K 4 ) dW u (4-17) 
for r > t, with known initial condition P t , the solution is (see [45], Section 5.6) 

where 

5 T i exp [(A - D*/2) (r - *) + jf i? <W.] • 

lowing the set of distributions 7> t = {F(P T < x|P t ), Vx : r > t}, we can evalu- 
ate Ct, for t > i and the reservation fee <f> follows. 

4.4. Simulations 

In this section, we simulate the pricing mechanisms presented in the preceding 
sections, applied to dial-up Internet access. 

Our data set consists of all the connections that were made to the Columbia 
University modem pool between April 4th and May 10th 1998. At that time, the 
system, with a total of 316 lines, was very heavily loaded (except for the period 
roughly from 2AM to 7AM every day, essentially all lines are always busy). There 
was a total 669,994 calls, from 9451 different users, with an average inter-arrival time 
of 5.59 seconds, and an average call duration of 1299.3 seconds (see Figure 4-3). 

The unsatisfied demand, i.e., the calls which encounter busy signals, are not 
logged in our data trace, since the blocking occurs in the public telephone network. 
Our data trace only includes admitted calls. Thus feeding the trace into a system of 
316 lines would result in no blocking, which would not be interesting since blocked 
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Inier-emvn) times — mean = 5.699 sec 




10" 



call duration — mean ■ 21.65 min 




250 



15 

hours 



Figure 4-3: Modem pool call statistics (solid) and equivalent - same mean - expo- 
nential distributions (dashed) 

calls are what gives rise to the market price. By scaling down the capacity of system 
in our simulation, we can recreate the congestion. The traffic was fed into a system 
running with 202 lines, of which IS were allocated to the first stage, and 184 to 
the second stage. The call arrival times and bid durations are from the real traffic 
trace (however the simulation was run at 10 x real-time speed), while bid prices are 
randomly generated from F. the uniform distribution on [0, 1). 

Rather than explicitly evaluating the set of distributions functions V t resulting 
from (4.18), our technique consists of Monte-Carlo siumulation. Specifically, at each 
time when a reservation has to be computed, beginning at the known current price 
P t , we generate sample paths by the difference equation corresponding to (4.15), 
which results in a set of histograms (one for each value of t) corresponding to V t . 
Figure 4-4 shows a representative example. 

Since the arrival process is clearly time- varying, the implemented reservation 
mechanism makes real-time estimates «,r A. and r, to evaluate the constants A, A' 3 , D 
and AV using the formulas in Section 1.0.. 
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12 




0.2 r 



Q L j 4 6 8 10 12 

Figure 4-4: Histograms of P T , for Po = 0 and r = 0, 1, 2 minutes; the bins represent 
price levels [0, 0.1), [0.1, 0.2), . . . , [0.9, 1). 





Figure 4-5: Simulation trace 
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Figure 4-5 shows a snapshot of the simulation trace. The first plot shows the 
number of users arriving in each batch interval, where J = 1 minute. The second 
plot shows the spot price process resulting from the batch auctions. As expected, 
prices are zero when the system is lightly loaded during the night. The average price 
including days and nights is 0.2055. 

The third and fourth plots show the number of users in the second stage, in the 
reserved and unreserved states. In this simulation, all users accept the reservation 
offer, thus they only remain in the unreserved state for a brief period, from the time 
the offer is sent by the auctioneer (which is a server program), until the acceptance 
returns from the user (which is a client program on a different host). 

In the fifth plot, "value" is the sum of the bid prices in the second stage (re- 
served+unreserved): this represents the social welfare, or the total value that the 
users are getting from the system. Without pricing, i.e., if all the users were sim- 
ply admitted on a FCFS basis, the average value during peak periods would be 
184 x 0.5 = 92, since the average bid price is 0.5. Here the average value during the 
day is approximately 110, i.e., the pricing mechanism yields a 20% gain in efficiency. 
In other words, without pricing there are many times when a user who values access 
less is denying a user who values it more. The efficiency is shown more clearly in 
Figure 4-6. normalized so that the horizontal line corresponds to the FCFS system. 
Note that there is an efficiency gain of up to 10% even during off-peak hours, because 
some of the off-peak users arrived during peak hours. 

The sixth plot of Figure 4-5 shows the cumulative revenue from usage charges 
and reservation fees. Since the batch interval is one minute, and usage charges are 
assessed on a minutely basis, we take the price of 1 as the "full price" reference level, 
i.e., ono unit of revenue is when 1 line is charged a price of 1 for I minute. The 
unit is (filled an MOU (minute-of-use). As can be seen on the plot, the revenue for 
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J . . 









Figure 4-6: Efficiency: 1-0 = first r come first-served system 

one day is approximately 125,000 MOUs. The 184 lines multiplied by 1440 minutes 
in a day yield about 264,960 potential minutes of use. Thus, on average, a modem 
line generates 0.47 MOUs of revenue per minute, including usage and reservation 
charges. 

Figures 4-7 and 4-8 show how the reservation fee offsets any attempt to "arbi- 
trage". Durations range from 0 to 24 hours, and the reservation fees range from 0 
to 640 MOUs, with the highest ratio being 1 MOU of reservation fee per minute of 
reservation duration, as can be seen by the straight line upper-bounding the scatter 
plot in Figure 4-7. The highest ratio (most expensive reservation) would be for a 
user who gets into the system with a bid price near zero and requests a long reser- 
vation at a time when the arrival rate, is high. That is illustrated by Figure 4-8. 
A user asking for a long duration reservation at a low bid price must pay a higher 
fee which compensates for the expected future rise in prices. The vertical variations 
are due to the market price at the moment the reservation offer is made, where the 
higher end are reservations made when the market price is high. 
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Figure 4-7: The reservation fee is at most lx duration. 
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Figure 4-S: Reservations are proportionally more expensive for low bid prices. 
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4.5. Approximations related to g 

We first seek a convenient approximation of (4.11). Fix V > 0. 

4.5.1 Case mu(l — u) > V 2 

By the Central Limit Theorem, the binomial density tends to the normal density, 
i.e., for 0 < u < 1, we have 

m , t . 1 { -[k-(l-u)m) 2 \ 



k\(m 

Thus, 



^"'"' m ) = (C-n)!(m-l-C + n)! U 
m — C + n 



1 r -[C-n-(l-u)mj 2 \ 
^T^) eXP l 2«(l-«)m J 



« yf2irmu(l — u) 

Differentiation yields 

a J 1 , C-n-m(l-u) ] 

^ («, n, m)«^( U ,n,m) [ m _ c + n + u (l - «)m J 

and 

C - n — m(l - tp 1 2 



o 2 f T i C — n — m(j 

^(u,n,m) « 5 («,n,m)^ m _ (C _ n) + u(1 _ u) 

~ (m-C + n) ~ u(l -u)m} 
Now, recalling that C (,) = /a + 7%//<», we get 

g< l) (u,Val: + cv/,m) « x 
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1 | -[( 7 -z)v^-(l-^f j 
^2^(1 - u) { 2u(l-«)m J" 

When z 7^ 7, as / — » oo the exponential goes to zero faster than the other factors 
grow, thus §z(u,Vaiz + a/,m), 0(u, y/alz + a/,m) -» 0. Therefore, by (4.13) 
and (4.14), ^(z) = ^(z) - 0. 
But when z = 7, 

*"><«,^ + oi , m) ~ ? -^^e^{r(l^} (4 . 19) 

J-s(ix,v£7z + a/,m) ^fl^^^Kv^/^ + a^m) (4.20) 

~-g(u, Vaiz + a/, m) a/ ( ) r rr r — * 

Oz 1 ■ \\m uJ m? ti(l — u)m\ 

gW(u,VZiz + aI,m) (4.21) 
as / — > 00, where ^ indicates that it is an approximation valid for large m. 
4.5.2 Case mu(l - u) < V 2 

Here we consider the neighborhoods of u = 1 and u = 0, where the normal approx- 
imation is no longer valid. Indeed 

g(l,n y m) = 

and 

g(0,n,rn) = ml {C -m+i}(n), 
where ls(x) = 1 if x € S and = 0, otherwise. 
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Thus, for u near 1 T we write 



gW(u,y/aiz + al,m) « ml/ , i (z) -* ml { ^{z) 




where £ is the Dirac-5 function. 




For u near 0, we get the same approximation with 7 replaced by 7 — which 
becomes identical to the above as / — ► 00. 



4.6. Approximating the Drift and Diffusion Coefficients 

When z ^ 7, by the results of 4.5.1 and 4.5.2, the right-hand side of both (4.13) 
and (4.14) -> 0, as / -* 00. Therefore, by the finiteness of (4.8) and .(4.9), the 
right-hand side of (4.12) — ► 0, as / — ► 00. 

Thus we need only be concerned with the regime z = 7. Let 



which satisfies «v tTn (l — u Vf m)m = V 2 . V can be interpreted as the smallest variance 
at which the binomial distribution is sufficiently close to the normal distribution. 
Typically V can be taken to be about 1. 

Now we will approximate g and its partials in three regions: [0,txv, m ). ("Urn* 1 — 



In the first hiu\ last regions, as shown in Section 4.5.2, the coefficients tend 



HereVz, ©iJ-^W^O. 



m 



UV.m]> and (1 - Mf.m-1]- 
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to 0- Thus we need only focus on the middle region, where the appoximations of 
Section 4.5.1 apply. Also, if m < 4V 2 , u Vt m is not real, and in fact we axe always in 
the case of Section 4.5.2, where the coefficients tend to zero. Thus we have non-zero 
terms only when m > 4V 2 . 

Writing a<'> = -r(7 + y/la) = -rC/>/?a, and plugging (4.20) into (4.13), we get 

_l£p(') + rCjfir ('«) > (4.22) 
m 

as I — * oo, where 

A v , ro ti famuli - u) I 2 " J 

= ff' ^ { ^^ ) l£±Il ll e -^ dw (4.23) 



where the second line follows from a change of variables w = y(* ujm/te, 
and 



Similarly, using <r M2 = r(l + 7 /v7a) = rCf(loc), and plugging (4.21) into (4.14) 
leads to 



/1-UV.m , /— 2 1 1 \ 

?7j 1 f — (1 — u)m l ^ 

» ^/25rmt«(l- «) eNP I 2u J 
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- rC(-2Kl m) /B + I<l m) ) (4-24) 

as / — ► co, where 

- * >m + w* } \ m J \ toVVw 

Finally, 

d.Z Jvv,m tt/ 

Substituting all of the above into (4.12) yields (4.15). 
4.6.1 Example: F uniform 

Suppose F is the uniform distribution on [0, P]. Then F-'(^) = hence 

*t* = 

= 2P[*(u>i m) )-*(4 m) )]> 

where $ is the normal distribution of mean zero, and variance 1. Also, 
Af° = P jll! (^Vm + 1 - 1/m - l/^ 2 ) \Jl^ /2 dw 
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4.7. Optimal Seller Strategies 

In this section, by way of conclusion, we briefly outline approaches to the comple- 
mentary control problem, i.e., how the seller can "tune" the mechanism in order to 
optimize some system-wide objective. 

Taking the above mechanism as a given, consider the optimal strategy for the 
network manager, whose objective is to maximize revenue, using fi(P) € [0,oo] 
as the (feedback) control. As the spot market is governed by auctions, P = 0 
whenever the system is not full. The relevance of the problem is during periods 
when the system is heavily loaded (i.e., almost always full). Then, the objective is 
equivalent to minimizing 

J(s,y,p) = -£ Ptdt, 

over all admissible controls ft(t) = p[P]. r is the stopping time r = inf {i > s : P t = 
0}, and the initial state is P a = y. The dynamic programming equation is 

0 = ?K + mm( AW -y), 

OS P 

for y > 0, and W(s,0) = 0. The differential operator A is given by [20] 
A=\ [D(ti)y + KM] + [A(?)y + KtQt)] ^- 

If W is a solution of the above equation, then the optimal cost is J(s,y,fi m ) = 
W(s,y), and, in the classical approaches, the optimal' control is obtained by per- 
forming the minimization over fi. In this case, even when W can be solved for, 
or guessed, for each y finding the control n(y) is in general not tractable since it 
involves polynomials of degree B\. 

There are a number of promising approaches 10 this control problem. It is inter- 
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esting to investigate the applicability of approximation techniques by discretizing 
the original problem, and solving iz as a Markov-decision problem, in the manner 
of [53] as well as that of [81], and compare the resulting performance with the opti- 
mal open-loop control. One should also consider some heuristic solutions based on 
related but simpler solvable problems, and investigate the relationship of the opti- 
mal control with the notion of Brownian local time, as in [11]. Finally, the trade-offs 
between optimality and computational complexity should be investigated. 
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What is claimed is: 

1 . A method for real-time market-based resource allocation, comprising: 

receiving at least one bid in real-time for a resource; 
5 deciding which of the at least one bids has won the bidding; and 

controlling the resource so that it is committed to the winning bidder. 

2. The method of claim 1, wherein the resource is bandwidth. 
10 3. The method of claim 1, wherein the resource is buffer space. 

4. The method of claim 1, wherein the resource is processor time. 

5. The method of claim 1, wherein the resource is controlled in real-time so that the 
15 winning bidder has access to the resource he has won as soon as bidding closes. 

6. The method of claim 1, wherein the bid is received from a software agent. 

7. The method of claim 6, wherein the software agent bids in accordance with a 
20 strategy rule. 

8. The method of claim 7, wherein the strategy rule is a truthful best reply strategy. 

9. The method of claim 6, wherein the software agent bids in accordance with a 
25 valuation rule. 

10. The method of claim 9, wherein the valuation rule is determined in accordance 
with at least one measured network parameter. 

30 11. The method of claim 6, wherein the software agent bids in accordance with an 

allocation rule. 
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12. The method of claim 1, wherein deciding which bid has won the bidding is 
performed in accordance with an allocation rule. 

13. The method of claim 12, wherein deciding which bid has won the bidding is 

5 performed in accordance with a market allocation rule also used by a buyer software agent, 
the market allocation rule defining the rules of the resource market. 

14. The method of claim 12, wherein deciding which bid has won the bidding is 
performed in accordance with an English Auction market allocation rule. 

10 

15. The method of claim 12, wherein deciding which bid has won the bidding is 
performed in accordance with a continuous bid-ask trading market allocation rule. 

16. The method of claim 12, where deciding which bid has won the bidding is 
15 performed in accordance with a progressive second price auction allocation rule. 

17. The method of claim 12, where deciding which bid has won the bidding is 
performed in accordance with a hold option allocation rule. 

20 18. The method of claim 1 , further comprising: 

storing information concerning which bid has won, for accounting purposes. 

19. The method of claim 1, wherein there are several resources, and a respective 
resource agent performs the elements of claim A for each resource. 

25 

20. The method of claim 1, wherein there is one resource, and a single resource 
agent performs the elements of claim 1 for the resource. 

21. The method of claim 1, wherein the elements of claim 1 are performed by a 
30 resource agent executing on a separate computer than the placing the bids. 
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22. The method of claim 1, wherein the elements of claim 1 are performed by a 
resource agent executing on the same computer as at least one placing at least one of the 
bids. 

5 23. The method of claim 1, wherein the is controlled by a human being, who 

decides how to bid. 

24. The method of claim 1, further comprising: 

deciding, by a bidder agent, how to bid based on a valuation function of the 
10 bidder agent and a strategy algorithm of the . 

25. The method of claim 1, further comprising: receiving an offer to sell resources 
from a seller agent. 

15 26. The method of claim 1, further comprising: 

receiving player agents in a garage of a resource agent and receiving bids 
from player agents in the garage. 

27. The method of claim 1, wherein the bid is received from a player agent that also 
20 submits offers to sell resources* 

28. The method of claim 1, further comprising: 

before bidding, receiving a request from a buyer software agent for a location 
of resource agent. 

25 

29. The method of claim 1, further comprising: 

sending at least one bid in real-time for a resource; 
receiving a notification that the sent bid has won the bidding; 
selling the use of the winning resource to third parties through a separate 
30 resource agent; and 

notifying a resource management and control agent that the third parties will 
be using the resource. 
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30. A method, performed by a multiagent, comprising: 

sending at least one bid in real-time for a resource in accordance with a 
strategy rule and a valuation rule of the multiagent and in accordance with a system- 
wide allocation rule; 
5 receiving a notification that the sent bid has won the bidding; 

selling the use of the winning resource to third parties through a separate 
resource agent; and 

notifying a resource management and control agent that the third parties will 
be using the resource. 

10 



3 1 . A method, performed by a buyer agent, comprising: 

sending at least one bid in real-time for a resource in accordance with a 
strategy rule and a valuation rule of the buyer agent and in accordance with a system-wide 
1 5 allocation rule; 

receiving a notification that the sent bid has won the bidding; and 
making use of the resource in real time, immediately after winning the bid. 



32. The method of claim 32, wherein the system wide-allocation rule is a PSP 
20 allocation rule. 



33. A system for real-time market-based resource allocation, comprising: 
means for receiving at least one bid in real-time for a resource; 
means for deciding which of the at least one bids has won the bidding; and 
25 means for controlling the resource so that it is committed to the winning 

bidder. 



34. A system for real-time market-based resource bidding by a multiagent, 
comprising: 

30 means for sending at least one bid in real-time for a resource in accordance 

with a strategy rule and a valuation rule of the multiagent and in accordance with a 
system-wide allocation rule; 

means for receiving a notification that the sent bid has won the bidding; 
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means for selling the use of the winning resource to third parties through a 
separate resource agent; and 

means for notifying a resource management and control agent that the third 
parties will be using the resource. 

5 

35. A system for real-time market-based resource bidding by a buyer agent, 
comprising: 

means for sending at least one bid in real-time for a resource in accordance 
with a strategy rule and a valuation rule of the buyer agent and in accordance with a system- 
10 wide allocation rule; 

means for receiving a notification that the sent bid has won the bidding; and 
means for making use of the resource in real time, immediately after winning 

the bid. 

15 36. The system of claim 35, wherein the system wide-allocation rule is a PSP 

allocation rule. 

37. A system for real-time market-based resource allocation, comprising: 

a portion configured to receive at least one bid in real-time for a resource; 
20 a portion configured to decide which of the at least one bids has won the 

bidding; and 

a portion configured to control the resource so that it is committed to the 
winning bidder. 

25 38. A system for real-time market-based resource bidding by a multiagent, 

comprising: 

a portion configured to send at least one bid in real-time for a resource in 
accordance with a strategy rule and a valuation rule of the multiagent and in 
accordance with a system-wide allocation rule; 
30 a portion configured to receive a notification that the sent bid has won the 

bidding; 

a portion configured to sell the use of the winning resource to third parties 
through a separate resource agent; and 
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a portion configured to notify a resource management and control agent that 
the third parties will be using the resource. 

39. A system for real-time market-based resource bidding by a buyer agent, 
5 comprising: 

a portion configured to send at least one bid in real-time for a resource in 
accordance with a strategy rule and a valuation rule of the buyer agent and in accordance 
with a system-wide allocation rule; 

a portion configured to receive a notification that the sent bid has won the 

10 bidding; and 

a portion configured to make use of the resource in real time, immediately 
after winning the bid. 

40. The system of claim 39, wherein the system wide-allocation rule is a PSP 
15 allocation rule. 

41. A computer program product, including instructions for causing a data 
processing device to perform actions for real-time market-based resource allocation, 
comprising: 

20 receiving at least one bid in real-time for a resource; 

deciding which of the at least one bids has won the bidding; and 
controlling the resource so that it is committed to the winning bidder. 

42. A computer program product, including instructions for causing a multiagent of 
25 a data processing device to perform actions for real-time market-based resource bidding, 

comprising: 

sending at least one bid in real-time for a resource in accordance with a 
strategy rule and a valuation rule of the multiagent and in accordance with a system- 
wide allocation rule; 

30 receiving a notification that the sent bid has won the bidding; 

selling the use of the winning resource to third parties through a separate 
resource agent; and 
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notifying a resource management and control agent that the third parties will 
be using the resource. 

43. A computer program product, including instructions for causing a buyer agent 
of a data processing device to perform action for real-time market-based resource bidding, 
comprising: 

sending at least one bid in real-time for a resource in accordance with a 
strategy rule and a valuation rule of the buyer agent and in accordance with a system-wide 
allocation rule; 

receiving a notification that the sent bid has won the bidding; and 
making use of the resource in real time, immediately after winning the bid. 

44. The computer program product of claim 43, wherein the system wide-allocation 
rule is a PSP allocation rule. 
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Figure 4 
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1 00 units of resource: 
A bids for 50 at $3, 
B bids for 30 at $2 
C bids for 30 at $1 
D bids for 20 at $0.50 




Example Market Allocation Rule (PSP) 

Figure 5 
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Diffex Server 




Figure 9(b) 
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<?xmlversion="1.0"encoding="UTF-8" ?> 

- <AuctionPlayer context^httpi/ZHOSTNAMEiHTTP^PORT/bx/garage^ 
- <SingleFrameGUi> 

<TextPanel name="News" height= "50" visible="true" border= 0 false a /> 
<LoginPanel name="Login" heights 60° visible="true" border="true" /> 
<ResourceAgentPanel name= n ResourceAgent a height= D 80 n visible="true" 
border= n true" /> 

<UploadAgentPanel name= 0 Garage D height= n 80° visibie="true" 
border=°true"/> 

<BidCanvasPanel name= n BidCanvas 0 height= n 180" visible="false" 
border="true" /> 

- <StrategyChoicePanel name="Strategies" height="160" visible="true" 

border="true"> 

<StrategyPanel name="Manual" strategy=°ManualStrategy" /> 
<StartegyPanelNotEdHable name="Auto" strategy="TruthfulStrategy" l> 
</StrategyChoicePanel> 

- <ValuationChoicePanel name="Valuations" height="240" visible="false" 

border="true"> 

<WebValuationPanel name="Web Valuation" valuation="WebValuatlon" l> 
<ValuationPanel name="Elastic Demand" valuation "RobertsValuation" /> 
<ValuatlonPanel name="lnelastic Demand" valuation="UneaiValuation" l> 
<BudgetValuationPanel name="Budget Valuation" label=" " 
valuation="BudgetValuation" /> 
<A/aluationChoicePanel> 

<PlayerlnfoPanel name="Allocatlon" height="120" visible="true" 
border="true" /> 

<BudgetPanel name="Budget" height="80" visible="false" border="true" /> 
<DisplayPanel name="Units" height="80" visible^false" border="true" l> 
<IPAddressPanel name="IP" height="110" visible="false" border="true" /> 
<ConnectionPanel name="Connection" height="140" visible="false" 
border="true" /> 

<BidTablePanel hame="Bid Table" height="400" visible="false" 
border="true"/> 

<Bid6raphPanel name="Bld Graph" height="400" visible="false" 
border="true B /> 

<AllocationGraphPanel name="AI»ocatlon Graph" height="400" visibie="false" 
bordef="true"/> 
</SingleFrameGUI> 

<Playerldentity name="USERNAME° passwd="PASSWD" ipaddress="IP_ADDRESS" 
netmask="NETMASK° l> 

- <LinearValuation label="lnelastic Demand"> 

<Parameter name="qmax" value="45000.0" label="Kbps" /> 
<Parameter name="vmax" value="44928.0" label="$/month° /> 
</LinearVa{uation> 

- <RobertsValuation current="false" label= u Elastic Demand"> 

<Parameter name="qmax" value="45000.0" label="Kbps" !> 
<Parameter name="vmax" va!ue="4928.0° label="$/month" l> 
</RobertsValuation> 

- <BudgetValuation current="true" label="Budget Valuation^ 

Parameter name="qmax" value="1000.0" label="Kbps" /> 
Parameter name="budgef value="100.0" label="$/month° l> 
</BudgetValuation> 

Figure 11(a) 
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- <WebValuation iabel= n Web Valuation^ 
<param name= a delay° value= n 100.0° !> 
<param name= n hitspermonth n value= n 100000.0 B /> 
<param name= D fHesize a vaIue= ,, 1000.0 Il /> 
<param name= D centsperhif value= D 0.1° l> 
<param name= n randomize n value=°false n l> 
</WebValuation> 

<Parameter name= D budget n value= n 51840.6° label= B $/month a l> 
<ManualStrategy current= n false n label= a Manual 0 1> 
<TruthfulStrategy current= n true 0 label=°Auto n l> 
<resourceAgentURL nickname^RESOURCE^NAME 0 

current=^rue">http://H^^ 
<uploadURL nickname= n HOSTNAME 

garage n >http://HOSTNAME:HTTP_PORT/bx/garage</uploadURL> 
<param name= n playertnterval n value^OOO" f> 
<param name=°timeouf value= n 2000° /> 
<param name^timelaber 1 value^min" /> 
<param name= n currencylabel° value= n c° /> 
<param name^quantityfabeF va!ue=°Mbps n /> 
<param name= n debug n value="false n /> 
</AuctionPlayer> 



Figure 11(b) 



SUBSTITUTE SHEET (RULE 26) 



WO 01/88811 



PCT/US01/15424 



14/38 

r 

* File: Truthful.java 

* Remark; Strategy for player with diminishing returns 
* 

* $ld: Truthful.java,v 1 .16 07:43:19 cobe Exp $ 

* 

7 

package ihn.merkato; 
import org.w3c.dom*; 
import com.sun.xml.tree.XmlDocument ; 

r 

* The strategy that bids the truthful best reply as in Proposition 1 o 
f 

* the PSP paper. 

* It will only submit the bid if utility will be increased by at least 
*epsilon. 

*<p> 

*@author Nemo Semret 

*( 
7 

public class Truthful extends AuctionStrategy { 
Bid tmp = createBid( ) ; 

* Finds truthful best reply as in Proposition 1 of 

* the PSP paper. 

* Sets the bid at the player if utility will be increased by at leas 

t 

* epsilon. 

* <p> 

* If timelogging is enabled, this will write to the player's 

* log a line with current time, bid, allocation, and utility, 

* at each call. 

* @see #epsilon 

* @see ihn.merkato.AuctionPlayer#setBid 
7 

public boolean bid( ) { 

double lq=0, uq= getPlayer ( ) .getValuation ( ) .qmax ( ) , mq= (uq+Iq) / 2, 
dq= getPlayer ().dq{); 

rf(debug()) { 

getPlayer ( ) .log fq range = r+lq+ 
n , n +uq+ a ] dq="+dq+ 
n Q=°+getPlayer(). stuff ()); 
getPlayer ( ) .addnews ( a . " ) ; 

Example of Agent Strategy 

Figure 12(a) 
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Truthful.txt 

// see Proposition 1 
int i=0 ; 

double mp, dv; 

while ( uq-lq >dq && i<20 ) { 
i++; 

mq= (lq=uq)/2; 

r 

if (mq < getPlayer ( ) .stuff ( ) - 

(getBidder ( ) .getBidlist ( ) ) .demandAtPrice ( 

getPlayer ( ) .dval (mq, mq+dq) , 

getPlayer ( ) .getld ( ) ) ) 

7 

/ / the following is equivalent and more general 
dv-getPlayer ( ) .getValutaion ( ) .dval (mq, mq+dq) ; 
mp=getBidder ( ) .getBidList ( ) .marketPrice (getPlayer ( ) .stuff ( ) 

-mq, 

getPlayer (). getld () 

); 

if (debug ()) 

getPlayer (). log ( B i=°+i+" mq="+mq+" dv="+dv+" mp="+mp) ; 

if (dv>mp) 

lq=mq ; 
else 

uq=mq ; 



tmp.bidderid = getPlayer ( ) .getld ( ) ; 
tmp.price = Data.MAXPRICE; 
tmp.qty = Iq; 

if(debug()) 

getPlayer ( ) .log ( °°+ i +" steps, q range = f+lq 

+" l 0 +uq+"] currentbid="+ 
getBidder ( ) .anteBid ( ) + ' found °+tmp) ; 

if (util (tmp) <0) { 
uq= tmp.qty ; 
lq=0; 

} 

i=0; 

while (uq-lq>dq && i<20) { 
tmp.qty = (uq+lq)/2; 
i++; 

if(debug()) 

getPlayer ( ) .log ("\=°+W q=" + tmp.qty ) ; 
if (util (tmp) <0) 

uq= tmp.qty; 
else 

Iq = tmp.qty ; 

Example of Agent Strategy 

Figure 12 (b) 
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} 

/ / need this in case the above loop is just outside the budget 
while (utU ( tmp) <0 && tmp.qty>0 && i <40) { 
i++; 

tmp.qty -=dq; 
if(debug()) 
getPlayer (). log C^+i*- q=°+tmp.qty) ; 

} 

if(debug()) 

getPlayer ( ) .log f"+l+" steps. range=[ "+lq+" , "+uq+" ] currentbid="+ 
1232 getBidder().anteBid().qty); 

tmp.price = getPlayer ( ) . getValuation ( ) .dval ( tmp.qty, tmp.qty+dq) ; 

double u = getPlayer ( ) .currentUtil ( ) ; 
double newu = util (tmp) ; 

if (debug ()) { 
getPlayer ( ) .log Ccurrerrtalloc= n + 

getBidder ( ) .currentAllocation ( ) + 
" newbid="+tmp+" antebid="+ 
getBidder ( ) .antBid ( ) ) ; 
getPlayer ( ) .log ("u= ,, +u+ n newu="+newu+" ante=" 
+util (getBidder ( ) .anteBid ( ) ) 
+" fee="+getBidder ( ) .bidFee ( ) 
+• epsilon="+epsilon ( ) 
+" avgdur="+getAvgDuration ( ) ) ; 

} 

if (getPlayer (). trace ()) { 
Bid alloc = getBidder ( ) .currentAllocation ( ) ; 

getPlayer ( ) .log (" n +getBidder ( ) .anteBid ( ) .qty 

+°\f +getBidder ( ) .anteBid ( ) .price+°\f + 
alloc.qty+"\f+alloc.price+ 0 \f+ 
getPlayer ( ) .currentUtil ( ) ) ; 

} 

r 

if ( newu > u + epsilon ( ) ) { 

if (debug ( ) ) getPlayer ( ) .addnews ( 0 * " ) ; 
return getBidder ( ) .setBid (tmp.qty, tmp.price) ; 

} 

else { 

if (debug ()) getPlayer () .addnews (■-■) ; 
v return false ; 
} 

} 

* Example of Agent Strategy 

Figure 12(c) 
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r 

* BidList object 
* 

* File: PSPBidList.java 

* 
* 
* 
* 

*/ 

package ihn.diffpex; 

import ihn.merkato.Bid; 
import ihn.merkato.Data; 

* 
* 

*/ 

public class PSPBidList extends ihn.merkato.GenericBidList { 

I** 

* Compute an allocation given the current profile of opponents 

in 

* this bidiist. This class uses the Progressive-Second-Price 

* auction rule. 

* @param tb The bid for which the allocation is to be calculate 

d. 

* ©pararn Q The total quantity of resource available. 
*/ 

public Bid allocation (Bid tb, double Q) { 
return PSPallocation (tb, Q) ; 

} 

I** 

* Cumpute an allocation given the current profile of opponents 

in 

* this bidiist, with the Progressive-Second-Price 

* auction rule. 

* @param tb The bid for which the allocation is to be calculate 

Figure 13(a) 
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d. 

* @param Q The total quantity of resource available. 
*/ 

private synchronized Bid PSPallocation (Bid tb, double Q) { 
Bid index = top; 
Bid alloc= new Bid ( ) ; 

double leftover = Q; / /leftover with player id 
double leftoverwo =Q; / /leftover without player id 
double qAj , qAjO , num=0, den=0 ; 
boolean gotcha = false ; 

alloc.qty = Math.min (tb.qty, 

Math.max(Q-demandAtPrice(tb.price, tb.bid 

derid),0)); 

while(index.next != null) { 
index = index.next; 

if(index.bidderid != tb.bidderid) { 

if (index.price <= tb.price && Igotcha) { 
leftover -= tb.qty ; 
if (leftover <=0) leftover=0 ; 
gotcha = true ; 

} 

qAj = (index.qty <= leftover ? index.qty : leftover) ; 
qAjO= (index.qty <= leftoverwo ? index.qty : leftoverwo) ; 
num += (index.price* (qAjO- qaj) ) ; 
// den += ( qAjO- qAj) ; 

leftoverwo - = index.qty; 
leftover - = index.qty ; 
if (leftover <=0) leftover=0 ; 
if (leftoverwo <=0) leftoverwo=0 ; 

} 



} 

/ / if (Igotcha) alloc.qty = (tb.qty<= leftover ? tb.qty : 

Figure 13(b) 
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leftover) ; 

/ / ailoc.price = den>0 ? num/den : 0 ; 

alloc.price = alloc.qty>0 ? num/alloc.qty : 0 ; 
alloc.bidderid = tb.bidderid ; 

return alloc; 

} 

jirk 

* Bids with ID#0 are not counted. 
*/ 

public double revenue (double Q) { 
Bid index = top; 
double r=0 ; 
int l=0 ; 
Bid al ; 

while (index.next != null) { 
index = index.next; 

if (index.bidderid!=0) { 

al = allocation (index, Q) ; 
r+= al.qty*al.price; 

/ / value(index.bidderid,Q) ; 

} 

} 

return n 

// If (l==0) return 0; 

/ / else return r - (1-1) *value(Data.NOBODY, Q) ; 

} 



} 

/ / substituted 8 float to double 



Figure 13(c) 
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<?xml version "1.0° encoding =°UTF-8 n ?> 

- <GenericAuctionAgent 

(»ntext= D httprf/HOSTNAME:HTTP_PORT/bx/RESOURCE_NAME n > 
<Playerldentity name= a RESOURCE^USER tt passwd= n RESOURCE_PASSWD a 
ipaddress="127.0.0.1° netmask= n 255.255.255.25y l> 
-<PSPBidList> 

<param name= n randomduration n value= n false p l> 
<param name=°duration n value= n 60000° /> 
<param name=°mustconv n value= n true n /> 
<param name^bidfee* value="0.01° /> 
<param name= n capacity n value D 20000.0 H l> 
</PSPBidList> 

<UnixCryptAuthenticator passwdfile= ,, MERKATO_HOME/accounts/passwd ,, /> 

- <LinearValutaion> 

<parameter name=°qmax" value=°QMAXVAL° label= p QMAXJJNITS n /> 
parameter name= n vmax° va!ue= w VMAX.VAL° labe!=^MAX_UNITS° /> 
</LinearValuation> 

<paramname= n accountingDriverClass a 

value= n ihn.merkatoAccountManager" /> 
<param name^'accountFile" 

vaIue=°http://HOSTNAME:HTTP.PORT/bx/dbstub n /> 
<param name= n hwDriverC!ass° value=°RESOURCE J)RIVERCLASS n /> 
<param name= a hwDevice n value= n RESOURCEJ)RIVERJNir /> 
<param name= a maxNBids° value="100 n l> 
<param name= n verbose B value= n true D l> 
<param name= n rememberld$ n value=°false° l> 
<param name= n clientTimeouf value="60000 n l> 
<param name=°serverTimeouf value=°30000 tt £> 
<param name="pause n value^SOOO 0 /> 
<param name= n detailedlog n value^true" l> 
<pa^amete^name= ,, maxBidFee ,, value=°1.0 n label= n $° /> 
<parameter name= n maxAccountBalance° value=°10Q00.0 a label=T l> 
</6enericAuctionAgent> 



Figure 14 
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Wizard - Bid Widow 




Show 
auction 
graph 

Countdown timer for 
current auction. 
Reset whenever a 
new bid is received. 



Uploads agent 
to garage where 
it can continue 
bidding and exits 



Display 
account 
summary 
screen 



"Budget" is used to 
calculate price per unit 
bandwidth bid during 
auction. 



This is the last price offered 
with quantity desired bid 
(changes often so need to 
"Refresh"). 

Amount of bandwidth and 
extended price allocated to 
this agent during the last 
auction round 

Select the units for the 
display. Previously entered 
values will scale to the 
new units 



Display help. When checked, 
mouse-button presses bring up 
help rather than performing 
function 



"Stop" means to stop bidding. This bidding agent will not be charged and they will be 
placed in the shared best-effort queue. 

"Upload" uploads the configuration to the garaged agent. Not that this will change 
some advanced settings to those assumed by this simple valuation and strategy 
model. 

This simple budget-based valuation model has the bidding agent attempt to get as 

much bandwidth as possible without exceeding the budget number. 

The strategy is based on the formula: price-per-unit-bandwidth * bandwidth- 

allocated=total-price-paid 

Where the total-price-paid ("budget") is held constant, and the other two variables 
allowed to be altered. 

Following this strategy, the bidder will first attempt to get all the bandwidth the seller is 
offering for their budgeted amount, which works out to the lowest possible price-per- 
unit-bandwidth. If unsuccessful, the bidding agent gradually increases the the offered 
price-per-unit-bandwidth and decreases the desired amount of bandwidth, until they 
successfully win an allocation. 

If all bidders follow this valuation model, they will each get a bandwidth allocation that 
is the same proportion to total bandwidth as their budget is to the combined budgets 
of all bidders. 

From the "Help" screen" 

• Press Start to tell your agent to start bidding for you. 

• Press Stop to tell your agent to stop 



Figure 15(c) 
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"Resource Agent" Subscreen 

Selection screen for resource for which you are bidding 



n^demo-merkatomuctionmuyer/product/deniotf7S LDlxl 



File View Help 



Valid bidjfvugfyitedlkjotnutgoimyn876bu6jytfgo8ntb7o6r6b 
ALLOC 2.14 Mbps @ 156 12 oizdng;odrk 
netbpsmpoxfmpfotmhnpxnmcypnm 



I— Resource Agent - 



dzpgimzoir goeimrt woueirnc[AOmd 



| Java Applet Window 



Pull-down menu 
allows you to 
determine which 
resource you would 
like to bid on. 



Figure 15(f) 
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